Chapter3 - Basic Processing Unit
Chapter3 - Basic Processing Unit
Chapter3 - Basic Processing Unit
TOPICS COVERED: 1.Fundamental concepts T1 7.1 pp 412-420 2. Execution of complete instruction T1 7.2 pp 421-422 3. Multiple bus organization T1 7.3 pp 423-425
Overview
Instruction Set Processor (ISP) Central Processing Unit (CPU) A typical computing task consists of a series of steps specified by a sequence of machine instructions that constitute a program. An instruction is executed by carrying out a sequence of more rudimentary operations.
Fundamental Concepts
Processor fetches one instruction at a time and perform the operation specified. Instructions are fetched from successive memory locations until a branch or a jump instruction is encountered. Processor keeps track of the address of the memory location containing the next instruction to be fetched using Program Counter (PC). Instruction Register (IR)
Executing an Instruction
Fetch the contents of the memory location pointed to by the PC. The contents of this location are loaded into the IR (fetch phase). IR [[PC]] Assuming that the memory is byte addressable, increment the contents of the PC by 4 (fetch phase). PC [PC] + 4 Carry out the actions specified by the instruction in the IR (execution phase).
Processor Organization
Instruction Address lines decoder and MAR control logic Memory bus MDR Data lines IR Y Constant 4 R0 Select MUX Add ALU control lines Sub A B R n - 1 ALU Carry -in XOR Z TEMP
PC
Datapath
Executing an Instruction
Transfer a word of data from one processor register to another or to the ALU. Perform an arithmetic or a logic operation and store the result in a processor register. Fetch the contents of a given memory location and load them into a processor register. Store a word of data from a processor register into a given memory location.
Register Transfers
Internal processor bus Riin Ri
Riout Yin
Y
Constant 4 Select MUX A ALU Zin Z B
Zout
Figure 7.2. Input and output gating for the registers in Figure 7.1.
Bus
Register Transfers
D Q 1 Q Riout Ri in Clock
Figure 7.3. Input and output g for one gister bit. ating re
All operations and data transfers are controlled by the processor clock.
Figure 7.3. Input and output gating for one register bit.
The ALU is a combinational circuit that has no internal storage. ALU gets the two operands from MUX and bus. The result is temporarily stored in register Z. What is the sequence of operations to add the contents of register R1 to those of R2 and store the result in R3?
1. 2.
3.
The response time of each memory access varies (cache miss, memory-mapped I/O,). To accommodate this, the processor waits until it receives an indication that the requested operation has been completed (Memory-Function-Completed, MFC). Move (R1), R2
MAR [R1] Start a Read operation on the memory bus Wait for the MFC response from the memory Load MDR from the memory bus R2 [MDR]
out
Timing
MAR [R1] Assume MAR is always available on the address lines of the memory bus.
Wait for the MFC response from the memory Load MDR from the memory bus
R2 [MDR]
Add (R3), R1 Fetch the instruction Fetch the first operand (the contents of the memory location pointed to by R3) Perform the addition Load the result into R1
Architecture
Internal processor bus Riin Ri
Riout Yin
Y
Constant 4 Select MUX A ALU Zin Z B
Zout
Figure 7.2. Input and output gating for the registers in Figure 7.1.
Step 1 2 3 4 5 6 7
Action
Memory bus
PCout , MAR in , Read, Select4, dd, Zin A Zout , PCin , Y in , WMF C MDR out , IR in
MDR
Data lines
IR
Constant 4
R0
Select
MUX
Add
Sub
R n - 1
ALU
Figure 7.6. Control sequence executionof the instruction Add (R3),R1. for
Add (R3), R1
A branch instruction replaces the contents of PC with the branch target address, which is usually obtained by adding an offset X given in the branch instruction. The offset X is usually the difference between the branch target address and the address immediately following the branch instruction. Conditional branch
Step Action
1 2 PCout , MAR in , Read, Select4,Add, Z in Zout , PCin , Yin , WMF C
3
4 5
MDR out , IR in
Offset-field-of-IR , Add, Z in out Z out , PCin , End
Multiple-Bus Organization
Bus A Bus B Bus C
Incrementer PC Re gister f ile Constant 4
MUX
ALU
Instruction decoder
IR
MDR
MAR
Address lines
Multiple-Bus Organization
Step Action 1 2 3 4 PCout, R=B, MAR in , Read, IncPC WMFC MDR outB , R=B, IR in R4outA , R5outB , SelectA, Add, R6in , End
Figure 7.9. Control sequence for the instruction. Add R4,R5,R6, for the three-bus organization in Figure 7.8.
Quiz
IR
Y Constant 4 R0
Select
MUX Add
Sub
A ALU
R n - 1
What is the control sequence for execution of the instruction Add R1, R2 including the instruction fetch phase? (Assume single bus architecture)
Hardwired Control
Overview
To execute instructions, the processor must have some means of generating the control signals needed in the proper sequence. Two categories: hardwired control and microprogrammed control Hardwired system can operate at high speed; but with little flexibility.
Control signals
Clock
External inputs
Generating Zin
Zin = T1 + T6 ADD + T4 BR +
Branch
T4 Add T6
T1
Figure 7.12. Generation of the Zin control signal for the processor in Figure 7.1.
Generating End
Branch<0 Add N T7 T5 N T4 T5 Branch End
Instruction unit
Floating-point unit
A Complete Processor
Bus interf ace
Instruction cache
Data cache
Processor
Sy stem us b
Main memory
Input/ Output
Microprogrammed Control
Overview
MDRout PCin
1 2 3 4 5 6 7
0 1 0 0 0 0 0
1 0 0 0 0 0 0
1 0 0 1 0 0 0
1 0 0 1 0 0 0
0 0 1 0 0 1 0
0 0 1 0 0 0 0
0 1 0 0 1 0 0
1 0 0 0 0 0 0
1 0 0 0 0 1 0
1 0 0 0 0 1 0
0 1 0 0 0 0 1
0 0 0 0 1 0 0
R1in
Add
Z out
IRin
0 0 0 0 0 0 1
0 0 0 1 0 0 0
0 1 0 0 1 0 0
Control signals are generated by a program similar to machine language programs. Control Word (CW); microroutine; microinstruction
Figure 7.15 An e xample of microinstructions for Figure 7.6.
End
Yin
Zin
Micro instruction
WMFC
MAR in
Select
Read
PCout
R1 out
R3 out
0 0 0 0 0 0 1
Step
Action PCout , MAR in , Read, Select4, dd, Zin A Zout , PCin , Y in , WMF C MDR out , IR in R3out , MAR in , Read R1out , Y in , WMF C MDR out , SelectY,Add, Zin Zout , R1 in , End
Overview
1 2 3 4 5 6 7
Figure 7.6. Control sequence executionof the instruction Add (R3),R1. for
IR
Overview
Clock
PC
Control store
CW
Control store
One function cannot be carried out by this simple organization.
Overview
The previous organization cannot handle the situation when the control unit is required to check the status of the condition codes or external inputs to choose between alternative courses of action. Use conditional branch microinstruction.
Address Microinstruction
0
1
MDRout , IR in
3 Branch to starting address appropriatemicroroutine of . ... .. ... ... .. ... .. ... ... .. ... ... .. ... .. ... ... .. ... .. ... ... .. ... .. 25 26 27 If N=0, then branch to microinstruction0 Offset-field-of-IRout , SelectY, Add, Z in Zout , PCin , End
Overview
External inputs IR Starting and branch address generator Condition codes
Clock
PC
Control store
CW
Figure 7.18.
Microinstructions
A straightforward way to structure microinstructions is to assign one bit position to each control signal. However, this is very inefficient. The length can be reduced: most signals are not needed simultaneously, and many signals are mutually exclusive. All mutually exclusive signals are placed in the same group in binary coding.
Further Improvement
Enumerate the patterns of required signals in all possible microinstructions. Each meaningful combination of active control signals can then be assigned a distinct code. Vertical organization Horizontal organization
Microprogram Sequencing
If all microprograms require only straightforward sequential execution of microinstructions except for branches, letting a PC governs the sequencing would be efficient. However, two disadvantages:
Having a separate microroutine for each machine instruction results in a large total number of microinstructions and a large control store. Longer execution time because it takes more time to carry out the required branches.
Example: Add src, Rdst Four addressing modes: register, autoincrement, autodecrement, and indexed (with indirect forms).
Mode
Contents of IR
OP code
0 1 11 10
0 87
Rsrc 4 3
Rdst 0
Address (octal)
Microinstruction
000
001 002 003 121 122 123 170 171 172 173
Rsrc , MARin , Read, Select4, Add,in Z out Zout, Rsrc in Branch { PC 170;PC [IR8]}, WMFC 0 MDRout, MARin, Read, WMFC MDRout, Yin Rdst , SelectYAdd, Zin , out Zout, Rdst , End in
Figure 7.21. Microinstruction for Add (Rsrc)+,Rdst. Note:Microinstruction at location 170 is not executed for this addressing mode.
The microprogram we discussed requires several branch microinstructions, which perform no useful operation in the datapath. A powerful alternative approach is to include an address field as a part of every microinstruction to indicate the location of the next microinstruction to be fetched. Pros: separate branch microinstructions are virtually eliminated; few limitations in assigning addresses to microinstructions. Cons: additional bits for the address field (around 1/6)
Figure 7.24. Implementation of the microroutine of Figure 7.21 using a next-microinstruction address field. Figure 7.23 for encoded signals.) (See
bit-ORing
Further Discussions
Prefetching Emulation