Unit IV IO Notes
Unit IV IO Notes
Unit IV IO Notes
INPUT/OUTPUT ORGANIZATION
Introduction
A general purpose computer should have the ability to exchange information with a wide
range of devices in varying environments. Computers can communicate with other computers
over the Internet and access information around the globe. They are an integral part of
home appliances, manufacturing equipment, transportation systems, banking and point-of-
sale terminals. In this chapter, we study the various ways in which I/O operations are
performed.
A single-bus structure
Consider, for instance, with memory-mapped I/O, if DATAIN is the address of the
input buffer of the keyboard
Move DATAIN, R0
And DATAOUT is the address of the output buffer of the display/printer
Move R0, DATAOUT
1
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
This sends the contents of register R0 to location DATAOUT, which may be the output data
buffer of a display unit or a printer.
Most computer systems use memory-mapped I/O. Some processors have special I/O
instructions to perform I/O transfers. The hardware required to connect an I/O device to
the bus is shown below:
The address decoder enables the device to recognize its address when this address
appears on the address lines. The data register holds the data. The status register contains
information. The address decoder, data and status registers and controls required to
coordinate I/O transfers constitutes interface circuit
For eg: Keyboard, an instruction that reads a character from the keyboard should be
executed only when a character is available in the input buffer of the keyboard interface.
The processor repeatedly checks a status flag to achieve the synchronization between
processor and I/O device, which is called as program-controlled I/O.
• Interrupts and
• Direct memory access
Interrupts: synchronization is achieved by having the I/O device send a special signal over
the bus whenever it is ready for a data transfer operation.
Direct memory access: For high speed I/O devices. The device interface transfer data
directly to or from the memory without informing the processor.
2
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
4.2 Interrupts
There are many situations where other tasks can be performed while waiting for an I/O
device to become ready. A hardware signal called an Interrupt will alert the processor when
an I/O device becomes ready. Interrupt-request line is usually dedicated for this purpose.
For example, consider, COMPUTE and PRINT routines. The routine executed in response to
an interrupt request is called interrupt-service routine. Transfer of control through the use
of interrupts happens. The processor must inform the device that its request has been
recognized by sending interrupt-acknowledge signal. One must therefore know the
difference between Interrupt Vs Subroutine. Interrupt latency is concerned with saving
information in registers will increase the delay between the time an interrupt request is
received and the start of execution of the interrupt-service routine.
Interrupt hardware
Most computers have several I/O devices that can request an interrupt. A single
interrupt request line may be used to serve n devices.
3
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
Vectored interrupts
A device requesting an interrupt may identify itself (by sending a special code)
directly to the processor, so that the processor considers it immediately.
Interrupt nesting
Simultaneous requests
The processor must have some mechanisms to decide which request to service when
simultaneous requests arrive. Here, daisy chain and arrangement of priority groups as the
interrupt priority schemes are discussed. Priority based simultaneous requests are
considered in many organizations.
At the device end, an interrupt enable bit determines whether it is allowed to generate an
interrupt request. At the processor end, it determines whether a given interrupt request
will be accepted.
Exceptions
The term exception is used to refer to any event that causes an interruption.
Hence, I/O interrupts are one example of an exception.
• Recovery from errors – These are techniques to ensure that all hardware
components are operating properly.
• Debugging – find errors in a program, trace and breakpoints (only at specific points
selected by the user).
• Privilege exception – execute privileged instructions to protect OS of a computer.
4
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
As we have seen earlier, the two commonly used mechanisms for implementing I/O
operations are:
• Interrupts and
• Direct memory access
Interrupts: synchronization is achieved by having the I/O device send a special signal over
the bus whenever it is ready for a data transfer operation
Direct memory access:
Basically for high speed I/O devices, the device interface transfer data directly to or
from the memory without informing the processor. When interrupts are used, additional
overhead involved with saving and restoring the program counter and other state
information. To transfer large blocks of data at high speed, an alternative approach is used.
A special control unit will allow transfer of a block of data directly between an external
device and the main memory, without continuous intervention by the processor.
DMA controller is a control circuit that performs DMA transfers, is a part of the I/O
device interface. It performs functions that normally be carried out by the processor. DMA
controller must increment the memory address and keep track of the number of transfers.
The operations of DMA controller must be under the control of a program executed by the
processor. To initiate the transfer of block of words, the processor sends the starting
address, the number of words in the block and the direction of the transfer. On receiving
this information, DMA controller transfers the entire block and informs the processor by
raising an interrupt signal. While a DMA transfer is taking place, the processor can be used
to execute another program. After the DMA transfer is completed, the processor can
return to the program that requested the transfer.
5
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
A conflict may arise if both the processor and a DMA controller or two DMA
controllers try to use the bus at the same time to access the main memory. To resolve this,
an arbitration procedure is implemented on the bus to coordinate the activities of all
devices requesting memory transfers.
Bus Arbitration
The device that is allowed to initiate data transfers on the bus at any given time is
called the bus master. Arbitration is the process by which the next device to become the
bus master is selected and bus mastership is transferred to it. The two approaches are
centralized and distributed arbitrations.
In Centralized arbitration, A simple arrangement for bus arbitration using a daisy chain
shows the arbitration solution. A rotating priority scheme may be used to give all devices an
equal chance of being serviced (BR1 to BR4). In Distributed arbitration, all devices waiting
to use the bus have equal responsibility in carrying out the arbitration process, without
using a central arbiter. The drivers are of the open-collector type. Hence, if the input to
one driver is equal to 1 and the input to another driver connected to the same bus line is
equal to 0 the bus will be in the low-voltage state. This uses ARB0 to ARB3.
4.5 Buses
The Primary function of the bus is to provide a communication path for the transfer
of data. It must also look in to,
–When to place information on the bus?
–When to have control signals?
Some bus protocols are set. These involve data, address and control lines. A variety of
schemes have been devised for the timing of data transfers over a bus. They are:
Bus master is an initiator. Usually, processor acts as master. But under DMA setup, any
other device can be master. The device addressed by the master is slave or target.
Synchronous bus
All devices derive timing information from a common clock line. Equally spaced pulses
on this line define equal time intervals. Each of these intervals constitutes a bus cycle
during which one data transfer can take place. Timing of an input/output transfer on a
synchronous bus is shown in figure 4.23.
6
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
Asynchronous bus
This is a scheme based on the use of a handshake between the master and the slave for
controlling data transfers on the bus. The common clock is replaced by two timing control
lines, master-ready and slave-ready. The first is asserted by the master to indicate that it
is ready for a transaction and the second is a response from the slave. The master places
the address and command information on the bus. It indicates to all devices that it has done
so by activating the master-ready line. This causes all devices on the bus to decode the
address. The selected slave performs the required operation and informs the processor it
has done so by activating the slave-ready line. A typical handshake control of data transfer
during an input and an output operations are shown in figure 4.26 and 4.27 respectively. The
master waits for slave-ready to become asserted before it removes its signals from the
bus. The handshake signals are fully interlocked. A change of state in one signal is followed
by a change in the other signal. Hence this scheme is known as a full handshake.
A parallel port transfers data in the form of a number of bits (8 or 16) simultaneously
to or from the device. A serial port transmits and receives data one bit at a time.
Communication with the bus is the same for both formats. The conversion from the parallel
to the serial format, and vice versa, takes place inside the interface circuit. In parallel port,
the connection between the device and the computer uses a multiple-pin connector and a
cable with as many wires. This arrangement is suitable for devices that are physically close
to the computer. In serial port, it is much more convenient and cost-effective where longer
cables are needed.
7
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
Parallel Port
A detailed figure showing the input interface circuit is presented in figure 4.29. Now,
consider the circuit for the status flag (figure 4.30). An edge-triggered D flip-flop is used
along with read-data and master-ready signals
8
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
–Slave-ready
–R/~W
–Master-ready
–Address decoder
–Handshake control
The input and output interfaces can be combined into a single interface. The general
purpose parallel interface circuit that can be configured in a variety of ways. For
increased flexibility, the circuit makes it possible for some lines to serve as inputs and
some lines to serve as outputs, under program control.
Serial Port
A serial interface circuit involves – Chip and register select, Status and control,
Output shift register, DATAOUT, DATAIN, Input shift register and Serial
input/output – as shown in figure 4.37.
The topics discussed under PCI are: Data Transfer, Use of a PCI bus in a computer
system, A read operation on the PCI bus, Device configuration and Other electrical
characteristics. Use of a PCI bus in a computer system is shown in figure 4.39 as a
representation.
Host, main memory and PCI bridge are connected to disk, printer and Ethernet
interface through PCI bus. At any given time, one device is the bus master. It has the
right to initiate data transfers by issuing read and write commands. A master is called
an initiator in PCI terminology. This is either processor or DMA controller. The
addressed device that responds to read and write commands is called a target. A
complete transfer operation on the bus, involving an address and a burst of data, is
called a transaction. Device configuration is also discussed.
9
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
The PCI bus is a good example of a system bus that grew out of the need for
standardization.
It supports the functions found on a processor bus bit in a standardized format that is
independent of any particular processor.
Devices connected to the PCI bus appear to the processor as if they were connected
directly to the processor bus. They are assigned addresses in the memory address space
of the processor.
The PCI follows a sequence of bus standards that were used primarily in IBM PCs. Early
PCs used the 8-bit XT bus, whose signals closely mimicked those of Intel’s 80x86
processors. Later, the 16-bit bus used on the PC At computers became known as the ISA
bus. Its extended 32-bit version is known as the EISA bus.
Other buses developed in the eighties with similar capabilities are the Microchannel
used in IBM PCs and the NuBus used in Macintosh computers.
The PCI was developed as a low-cost bus that is truly processor independent. Its design
anticipated a rapidly growing demand for bus bandwidth to support high-speed disks and
graphic and video devices, as well as the specialized needs of multiprocessor systems.
As a result, the PCI is still popular as an industry standard almost a decade after it was
first introduced in 1992.
An important feature that the PCI pioneered is a plug-and-play capability for
connecting I/O devices. To connect a new device, the user simply connects the device
interface board to the bus. The software takes care of the rest.
Data Transfer
In today’s computers, most memory transfers involve a burst of data rather than just
one word. The reason is that modern processors include a cache memory. Data are
transferred between the cache and the main memory in burst of several words each.
The words involved in such a transfer are stored at successive memory locations. When
the processor (actually the cache controller) specifies an address and requests a read
operation from the main memory, the memory responds by sending a sequence of data
words starting at that address. Similarly, during a write operation, the processor sends
a memory address followed by a sequence of data words, to be written in successive
memory locations starting at the address.
The PCI is designed primarily to support this mode of operation. A read or write
operation involving a single word is simply treated as a burst of length one.
The bus supports three independent address spaces: memory, I/O, and
configuration. The first two are self-explanatory. The I/O address space is intended
for use with processors, such as Pentium, that have a separate I/O address space.
However, as noted , the system designer may choose to use memory-mapped I/O even
when a separate I/O address space is available.
In fact, this is the approach recommended by the PCI its plug-and-play capability. A 4-
bit command that accompanies the address identifies which of the three spaces is being
used in a given data transfer operation.
The signaling convention on the PCI bus is similar to the one used, we assumed that the
master maintains the address information on the bus until data transfer is completed.
But, this is not necessary. The address is needed only long enough for the slave to be
selected. The slave can store the address in its internal buffer. Thus, the address is
needed on the bus for one clock cycle only, freeing the address lines to be used for
10
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
sending data in subsequent clock cycles. The result is a significant cost reduction
because the number of wires on a bus is an important cost factor. This approach in used
in the PCI bus.
At any given time, one device is the bus master. It has the right to initiate data
transfers by issuing read and write commands. A master is called an initiator in PCI
terminology. This is either a processor or a DMA controller.
The addressed device that responds to read and write commands is called a target.
Device Configuration
When an I/O device is connected to a computer, several actions are needed to configure
both the device and the software that communicates with it.
The PCI simplifies this process by incorporating in each I/O device interface a small
configuration ROM memory that stores information about that device. The
configuration ROMs of all devices is accessible in the configuration address space.
The PCI initialization software reads these ROMs whenever the system is powered up or
reset. In each case, it determines whether the device is a printer, a keyboard, an
Ethernet interface, or a disk controller. It can further learn bout various device options
and characteristics. Devices are assigned addresses during the initialization process.
This means that during the bus configuration operation, devices cannot be accessed
based on their address, as they have not yet been assigned one. Hence, the
configuration address space uses a different mechanism. Each device has an input signal
called Initialization Device Select, IDSEL#.
The PCI bus has gained great popularity in the PC word. It is also used in many other
computers, such as SUNs, to benefit from the wide range of I/O devices for which a
PCI interface is available.
In the case of some processors, such as the Compaq Alpha, the PCI-processor bridge
circuit is built on the processor chip itself, further simplifying system design and
packaging.
SCSI Bus
11
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
The acronym SCSI stands for Small Computer System Interface. It refers to a
standard bus defined by the American National Standards Institute (ANSI) under the
designation X3.131 .
In the original specifications of the standard, devices such as disks
are connected to a computer via a 50-wire cable, which can be up to 25 meters in length
and can transfer data at rates up to 5 megabytes/s.
The SCSI bus standard has undergone many revisions, and its data transfer capability
has increased very rapidly, almost doubling every two years. SCSI-2 and SCSI-3 have
been defined, and each has several options.
A SCSI bus may have eight data lines, in which case it is called a narrow bus and
transfers data one byte at a time.
Alternatively, a wide SCSI bus has 16 data lines and transfers data 16 bits at a time.
There are also several options for the electrical signaling scheme used.
Devices connected to the SCSI bus are not part of the address space of the
processor in the same way as devices connected to the processor bus.
The SCSI bus is connected to the processor bus through a SCSI controller. This
controller uses DMA to transfer data packets from the main memory to the device, or
vice versa.
A packet may contain a block of data, commands from the processor to the device, or
status information about the device.
Communication with a disk drive differs substantially from communication with the main
memory.
A controller connected to a SCSI bus is one of two types – an initiator or a target.
An initiator has the ability to select a particular target and to send commands
specifying the operations to be performed.
Clearly, the controller on the processor side, such as the SCSI controller, must be able
to operate as an initiator.
The disk controller operates as a target. It carries out the commands it receives from
the initiator.
The initiator establishes a logical connection with the intended target. Once this
connection has been established, it can be suspended and restored as needed to
transfer commands and bursts of data.
While a particular connection is suspended, other device can use the bus to transfer
information.
This ability to overlap data transfer requests is one of the key features of the
SCSI bus that leads to its high performance.
Data transfers on the SCSI bus are always controlled by the target controller. To send
a command to a target, an initiator requests control of the bus and, after winning
arbitration, selects the controller it wants to communicate with and hands control of the
bus over to it.
Then the controller starts a data transfer operation to receive a command
from the initiator.
The processor sends a command to the SCSI controller, which causes the
following sequence of event to take place:
The SCSI controller, acting as an initiator, contends for control of the bus.
When the initiator wins the arbitration process, it selects the target controller and
hands over control of the bus to it.
The target starts an output operation (from initiator to target); in response to this, the
initiator sends a command specifying the required read operation.
12
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
The target, realizing that it first needs to perform a disk seek operation, sends a
message to the initiator indicating that it will temporarily suspend the connection
between them. Then it releases the bus.
The target controller sends a command to the disk drive to move the read head to the
first sector involved in the requested read operation. Then, it reads the data stored in
that sector and stores them in a data buffer. When it is ready to begin transferring
data to the initiator, the target requests control of the bus. After it wins arbitration, it
reselects the initiator controller, thus restoring the suspended connection.
The target transfers the contents of the data buffer to the initiator and then
suspends the connection again. Data are transferred either 8 or 16 bits in parallel,
depending on the width of the bus.
The target controller sends a command to the disk drive to perform another seek
operation. Then, it transfers the contents of the second disk sector to the initiator as
before. At the end of this transfers, the logical connection between the two controllers
is terminated.
As the initiator controller receives the data, it stores them into the main memory using
the DMA approach.
The SCSI controller sends as interrupt to the processor to inform it that the
requested operation has been completed.
This scenario show that the messages exchanged over the SCSI bus are at a higher level
than those exchanged over the processor bus. In this context, a “higher level” means
that the messages refer to operations that may require several steps to complete,
depending on the device. Neither the processor nor the SCSI controller need be aware
of the details of operation of the particular device involved in a data transfer. In the
preceding example, the processor need not be involved in the disk seek operation.
• The target controller sends a command to the disk drive to move the read head
to the first sector involved in the requested read operation.
• The target transfers the contents of the data buffer to the initiator and then
suspends the connection again.
• The target controller sends a command to the disk drive to perform another
seek operation.
• As the initiator controller receives the data, it stores them into the main
memory using the DMA approach.
• The SCSI controller sends an interrupt to the processor to inform it that the
requested operation has been completed.
The bus signals, arbitration, selection, information transfer and reselection are the
topics discussed in addition to the above.
The USB has been designed to meet several key objectives such as:
• Provide a simple, low-cost and easy to use interconnection system that
overcomes the difficulties due to the limited number of I/O ports available on a
computer
13
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
Port Limitation
Here to add new ports, a user must open the computer box to gain access to the
internal expansion bus and install a new interface card. The user may also need to know
how to configure the device and the software. And also it is to make it possible to add
many devices to a computer system at any time, without opening the computer box.
Device Characteristics
The kinds of devices that may be connected to a computer cover a wide range of
functionality - speed, volume and timing constraints. A variety of simple devices
attached to a computer generate data in different asynchronous mode. A signal must be
sampled quickly enough to track its highest-frequency components.
Plug-and-play
USB architecture
The three basic approaches of I/O transfers are discussed. The simplest
technique is programmed I/O, in which the processor performs all the necessary control
functions under direct control of program instructions. The second approach is based on
the use of interrupts. The third I/O scheme involves DMA, the DMA controller
transfers data between an I/O device and the main memory without continuous
processor intervention. Access to memory is shared between the DMAQ controller and
the processor.
Three popular interconnection standards – PCI, SCSI, USB are discussed. They
represent different approaches that meet the needs of various devices and reflect the
increasing importance of plug-and-ply features that increase user convenience.
14
II B.Tech R16 – CO UNIT-IV I/O ORGANIZATION
References:
1. Carl Hamacher, Zvonko Vranesic, Safwat Zaky, Computer Organization, fifth edition,
2. Computer Architecture and Organization, John P. Hayes, 3rd Edition, McGraw Hill.
15