First Steps With Embedded Systems
First Steps With Embedded Systems
First Steps With Embedded Systems
with
Embedded
Systems
02A4 A ox40;
02A6 B gs&0x20)
02A9 C table();
[email protected]
CDSCode Development Systems
The Byte Craft Limited Code Development
Systems are high-performance embedded
[email protected]
http://www.bytecraft.com
Copyright ! 1997, 2002 Byte Craft Limited. Licensed Material. All rights reserved.
First Steps with Embedded Systems is protected by copyrights. All rights reserved. No part of this
publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise without the prior written
permission of Byte Craft Limited.
All example and program code is protected by copyright.
Printed in Canada 14 November, 2002
Table of Contents
1. Introduction 1
1.1 Typographical Conventions....................................................................... 1
1.2 Explaining the Microcontroller ................................................................. 2
1.3 Book Contents ........................................................................................... 3
2. Microcontroller Overview 5
2.1 What is a Microcontroller?......................................................................... 5
2.2 The Microcontroller in a System............................................................... 7
2.3 Architecture ............................................................................................... 7
2.3.1 Von Neumann ......................................................................................................................8
2.3.2 Von Neumann Memory Map.............................................................................................8
2.3.3 Harvard ..................................................................................................................................9
2.3.4 Harvard Memory Map.......................................................................................................10
2.3.5 The Central Processing Unit ............................................................................................11
2.3.6 Central Processing Unit.....................................................................................................13
2.3.7 ROM.....................................................................................................................................14
2.3.8 RAM .....................................................................................................................................15
2.3.9 I/O Ports.............................................................................................................................16
2.3.10 Timer..................................................................................................................................17
2.3.11 Interrupt Circuitry............................................................................................................18
2.3.12 Buses ..................................................................................................................................19
2.4 Sample Microcontroller Configurations.................................................. 19
2.4.1 Motorola MC68HC705C8 ................................................................................................19
2.4.2 National Semiconductor COP8SAA7 ............................................................................20
2.4.3 Microchip PIC16C54.........................................................................................................20
2.4.4 Microchip PIC16C74.........................................................................................................21
i
Table of Contents
ii
Table of Contents
3.10 Devices................................................................................................... 41
3.10.1 Mask ROM........................................................................................................................41
3.10.2 Windowed Parts ...............................................................................................................41
3.10.3 OTP....................................................................................................................................41
4. Programming Fundamentals 43
4.1 What is a Program?.................................................................................. 43
4.2 Number Systems ..................................................................................... 43
4.3 Binary Information.................................................................................. 44
4.4 Memory Addressing ................................................................................ 46
4.5 Machine Language.................................................................................. 46
4.6 Assembly Language ................................................................................ 46
4.6.1 Assembler ............................................................................................................................47
4.7 Instruction Sets........................................................................................ 47
4.8 The Development of Programming Languages ..................................... 48
4.9 Compilers ................................................................................................ 50
4.9.1 The Preprocessor ...............................................................................................................50
4.9.2 The Compiler ......................................................................................................................50
4.9.3 The Linker ...........................................................................................................................50
4.10 Cross Development................................................................................ 51
4.10.1 Cross compiler..................................................................................................................51
4.10.2 Cross development tools ................................................................................................51
4.10.3 Embedded Development Cycle.....................................................................................52
iii
Table of Contents
6. C Program Structure 65
6.1 C Preprocessor Directives........................................................................65
6.2 Identifier Declaration ..............................................................................65
6.2.1 Identifiers in Memory ....................................................................................................... 66
6.2.2 Identifier names ................................................................................................................. 66
6.2.3 Variable Data Identifiers................................................................................................... 67
6.2.4 Constant Data Identifiers ................................................................................................. 67
6.2.5 Function Identifiers........................................................................................................... 68
6.3 Statements ...............................................................................................68
6.3.1 The Semicolon Statement Terminator ........................................................................... 69
6.3.2 Combining Statements in a Block................................................................................... 69
iv
Table of Contents
v
Table of Contents
9. Control Structures 99
9.1 Conditional Expressions..........................................................................99
9.2 Decision Structures................................................................................ 100
9.2.1 if and else Statements...................................................................................................... 100
9.2.2 Nested if statements........................................................................................................ 101
9.2.3 Matching else and if......................................................................................................... 102
9.2.4 switch and case................................................................................................................. 103
9.2.5 Execution within a switch .............................................................................................. 103
9.2.6 Fall-through execution.................................................................................................... 104
9.2.7 The default case ............................................................................................................... 105
9.2.8 The goto Statement ......................................................................................................... 105
9.2.9 Comparing goto and switch..case.................................................................................. 106
9.3 Looping Structures ................................................................................ 106
9.3.1 Control expression .......................................................................................................... 106
9.3.2 The while loop.................................................................................................................. 107
9.3.3 The do loop ...................................................................................................................... 107
9.3.4 The for loop...................................................................................................................... 108
9.3.5 How the for loop works ................................................................................................. 108
9.4 Exiting a Loop....................................................................................... 109
9.4.1 The break Statement ....................................................................................................... 109
9.4.2 The continue Statement.................................................................................................. 109
vi
Table of Contents
vii
Table of Contents
viii
Table of Contents
ix
Table of Contents
x
Table of Examples
Example 1: Defining ports with #pragma directives.............................................................17
Example 2: Using a union structure to create a scratch pad.................................................28
Example 3: Using globally allocated data space in a function ..............................................29
Example 4: A typical assembly language program for the COP8SAA................................49
Example 5: Program in Example 4 compiled for the 68HC705C8 .....................................49
Example 6: A typical microcontroller program ......................................................................55
Example 7: Syntax for the main( ) function ............................................................................59
Example 8: Using the C assignment statement.......................................................................60
Example 9: The if statement syntax..........................................................................................61
Example 10: Nesting if and while statements .........................................................................62
Example 11: Calling one function from another ....................................................................62
Example 12: C functions containing inline assembly language ............................................64
Example 13: Common C keywords..........................................................................................66
Example 14: Using braces to delineate a block.......................................................................70
Example 15: The while loop......................................................................................................70
Example 16: Declaring variable types ......................................................................................72
Example 17: Assigning a character value.................................................................................76
Example 18: Octal, hex and binary notation...........................................................................79
Example 19: Data type modifiers..............................................................................................80
Example 20: Postfix and prefix unary operators ....................................................................85
Example 21: Sample binary operators......................................................................................85
Example 22: Trinary conditional operator...............................................................................86
Example 23: Combining operators in a statement .................................................................86
Example 24: Concatenating expressions with the comma operator....................................87
Example 25: Combining assignment operators in statements..............................................87
Example 26: Addition, subtraction and multiplication operators ........................................88
Example 27: Division and modulus operators........................................................................88
Example 28: Differentiating the division and modulus operators .......................................88
Example 29: Prefix and postfix notation for increment and decrement.............................89
Example 30: Postfix increment and decrement ......................................................................89
Example 31: Using prefix increment and decrement.............................................................90
Example 32: Variations on the assignment statement ...........................................................91
Example 33: Defining constant values for true and false......................................................91
Example 34: Defining constant values for true and false in a portable way.......................92
xi
Table of Examples
xii
Table of Examples
xiii
Table of Examples
xiv
Table of Figures
Figure 1: The microcontroller......................................................................................................7
Figure 2: Von Neumann memory map for the MC68705C8 .................................................9
Figure 3: Harvard memory map PIC16C74 ............................................................................10
Figure 4: Harvard memory map COP8SAA7 .........................................................................11
Figure 5: Instruction clocking on the PIC16C54....................................................................12
Figure 6: The CPU ......................................................................................................................13
Figure 7: MC68HC705C8 stack ................................................................................................15
Figure 8: Saving the machine state on the MC68HC705C8 .................................................33
Figure 9: Data storage VS. data value.......................................................................................45
Figure 10: RS-232 signal ...........................................................................................................170
Figure 11: Project schematic ....................................................................................................182
xv
Table of Tables
Table 1: Hardware characteristics of the Motorola MC68HC705C8 ..................................20
Table 2: Hardware characteristics of the National Semiconductor COP8SAA7...............20
Table 3: Hardware characteristics of the Microchip PIC16C54...........................................21
Table 4: Hardware characteristics of the Microchip PIC16C74...........................................21
Table 5: Sample vectored interrupts .........................................................................................32
Table 6: Binary, decimal and hexadecimal ...............................................................................44
Table 7: Interpretation of assembly language..........................................................................47
Table 8: Instruction set comparisons........................................................................................48
Table 9: Pointers and pointers-to-pointers ............................................................................122
Table 10: PC serial port addresses and interrupts.................................................................171
Table 11: UART chips ..............................................................................................................172
Table 12: COM port registers..................................................................................................173
Table 13: Interrupt enable register bits ..................................................................................174
Table 14: Interrupt identification register ..............................................................................175
Table 15: FIFO control register ..............................................................................................175
Table 16: Line Control Register ..............................................................................................176
Table 17: Modem Control Register ........................................................................................176
Table 18: Line Status Register..................................................................................................177
Table 19: Modem Status Register............................................................................................177
Table 20: Pin outs on the RS232 port ....................................................................................183
Table 21: Rules of operator precedence.................................................................................185
Table 22: ASCII characters ......................................................................................................187
xvii
Acknowledgements
This book represents the hard work of many people at Byte Craft Limited. We
want to offer as much of our experience as possible to those entering the
Embedded Systems field. We are leveraging our experience in embedded
systems, in technical communication, and in publishing to bring about
informative publications that do just that.
Kirk Zurell edited this publication and designed the cover art.
xix
1. Introduction
This book is intended to fill the need for an intermediate level overview of
programming microcontrollers using the C programming language. It is aimed
specifically at two groups of readers who have different, yet overlapping needs.
! The first group are familiar with C but require an examination of the general
nature of microcontrollers: what they are, how they behave and how best to use
the C language to program them.
" The second group are familiar with microcontrollers but are new to the C
programming language and wish to use C for microcontroller development
projects.
First Steps with Embedded Systems will be useful both as an introduction to
microcontroller programming for intermediate level post-secondary programs
and as a guide for developers coping with the growth and change of the
microcontroller industry.
NOTE
An important note will appear in this way.
each line of code written in C can replace many lines of assembly language.
Debugging and maintaining code written in C is much easier than in assembly
language code.
3
Introduction
4
2. Microcontroller Overview
This section provides a brief overview of general microcontroller features and
resources. It is designed to familiarise you with microcontroller terminology and
basic microcontroller architecture. Many of the concepts introduced in this
section will be revisited throughout the book.
NOTE
A microcontroller is not the same as a microprocessor. A microprocessor is a
single chip CPU used within other computer systems. A microcontroller is itself a
single chip computer system.
5
Microcontroller Overview
6
The Microcontroller in a System
2.3 Architecture
There are two basic types of architecture: Harvard and Von Neumann.
Microcontrollers most often use a Harvard or a modified Harvard-based
architecture.
7
Microcontroller Overview
Von Neumann architecture has a single, common memory space where both
program instructions and data are stored. There is a single data bus which
fetches both instructions and data. Each time the CPU fetches a program
instruction it may have to perform one or more read/write operations to data
memory space. It must wait until these subsequent operations are complete
before it can fetch and decode the next program instruction. The advantage to
this architecture lies in its simplicity and economy.
NOTE
On some Von Neumann machines the program can read from and write to CPU
registers, including the program counter. This can be dangerous as you can point
the PC at memory blocks outside program memory space. Careless PC
manipulation can cause errors which require a hard reset.
Every microcontroller has a very specific layout for its memory. Usually this is
depicted with the help of a memory map. A memory map is a diagram which
shows how the microcontroller memory is used. The following example map is
from the Motorola MC68HC705C8 microcontroller configured for 176 bytes of
RAM and 7744 bytes of PROM:
8
Architecture
Contents Address
I/O 32 bytes 0x0000
0x001F
User Prom 48 bytes 0x0020
0x004F
176 Bytes of RAM 0x0050
0x00BF
STACK 0x00C0
0x00FF
User PROM 96 bytes 0x0100
0x015F
User PROM 7584 bytes 0x0160
0x1EFF
Boot ROM 223 bytes 0x1F00
0x1FDE
Option Register 0x1FDF
Boot ROM vectors 16 bytes 0x1FE0
0x1FEF
Unused 4 bytes 0x1FF3
User PROM vectors 12 bytes
0x1FF4
0x1FFF
Figure 2: Von Neumann memory map for the MC68705C8
2.3.3 Harvard
9
Microcontroller Overview
Typically these chips have a register analogous to the program counter (PC)
which refers to addresses in program space. Also, some chips support the use
of any 16 bit value contained in data space as a pointer into the program
address space. These chips have special instructions to use these data pointers.
NOTE
It is important that you understand how your Harvard architecture part deals with
data in program space. It is possible to generate more efficient code using symbolic
constants declared with #define directives instead of declared constants. You may
also create global variables for constant values.
The following memory map is from the Microchip PIC16C74. Notice that
program memory is paged and data memory is banked. The stack is
implemented in hardware and the developer has no access to it.
Program Memory (4K) Data Memory (256 bytes)
Bank 0 Bank 1
Reset Vector 0x0000 Program 0x00 0x80
Counter Specific Specific
Registers Registers
Interrupt Vector 0x0004 8
Program Memory 0x0005 Level
Page 0 0x07FF Stack
Program Memory 0x0800
Page 1 0x0FFF 0x1F 0x9F
Unimplemented 0x1000 0x20 General General 0xA0
Purpose Purpose
0x1FFF 0x7F Register Register 0xFF
Figure 3: Harvard memory map PIC16C74
The following is the memory map for the COP8SAA7. The stack grows down
from the top of general purpose RAM.
10
Architecture
0x2F
Unused RAM 0x30
0x7F
0x80
Interrupt Vector 0x0FF Specific
Registers
0xFE
0x400 Segment Register 0xFF
Figure 4: Harvard memory map COP8SAA7
The central processing unit (CPU) does all the computing: it fetches, decodes
and executes program instructions and directs the flow of data to and from
memory. The CPU performs the calculations required by program instructions
and places the results of these calculations, if required, into memory space.
Most CPUs are synchronous. This means that they depend on the cycles of a
processor clock. A clock generates a high-frequency square wave usually
driven by a crystal, a RC (resistor capacitor) or an external source. The clock is
sometimes referred to as an oscillator. The clock speed, or oscillation rate, is
measured in megahertz (MHz) which represents one million cycles/second. For
example, if the clock speed is 3 MHz then there are 3,000,000 clock
cycles/second.
Clock configurations are microcontroller dependant. The following are some
sample clock configurations:
"# The National Semiconductor COP8SAA7 has four clock options: crystal
with bias resistor, crystal without bias resistor, R/C, and external. The
option is selected with bits 3 and 4 of the ECON register. The CK1 and
CK0 pins are used for clock related input and output.
"# The Motorola MC68HC705C8 has two pins, OSC1 and OSC2, which
provide connections for an on-chip oscillator. A crystal, ceramic resonator,
or external signal can be attached to the pins. The oscillator frequency is
11
Microcontroller Overview
two times the internal bus rate and the processor clock cycle is two times
the oscillator frequency.
"# The Microchip PIC16C54 has clock input pin OSC1/CLKIN and clock
output pin OSC2/CLKOUT. OSC1/CLKIN is internally divided by four
to generate four clocks. There are four possible modes: low power crystal,
crystal/resonator, high speed crystal, resistor/capacitor.
The clock controls the sequence of instructions. Most microcontrollers divide
their basic clock frequency to arrive at a bus-rate clock. Each instruction takes a
specific number of bus-rate clock cycles in order to execute. The following
depicts the clocking scheme for the Harvard architecture Microchip PIC16C54:
Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4
OSC
1
Q1
Q2
Q3
Q4
PC PC PC + 1 PC + 2
OSC
2
Fetch Instruction (PC)
Execute Instruction (PC-1) Fetch Instruction (PC+1)
Execute Instruction (PC) Fetch Instruction (PC+2)
Execute Instruction (PC+1)
Figure 5: Instruction clocking on the PIC16C54
12
Architecture
One part of the CPU is responsible for performing calculations and executing
instructions. This part is called the arithmetic logic unit, or ALU. There are a
variety of subsidiary components which support the ALU. These components
include the decoder, the sequencer and a variety of registers.
The decoder converts instructions stored in program memory into codes
which the ALU can understand. The sequencer manages the flow of data
along the microcontroller’s data bus. Registers are used by the CPU to
temporarily store vital data which are volatile: they can change during program
execution. Most microcontroller registers are memory-mapped, associated with
a memory location, and can be used like any other memory location.
13
Microcontroller Overview
Registers store the state of the CPU. If the contents of microcontroller memory
and the contents of these registers are saved it is possible to suspend program
operation for an indefinite period of time and restart exactly in the state when
the program was suspended.
The number and names of registers varies drastically among microcontrollers.
However there are certain registers which are common to most
microcontrollers, although the names may vary. These include:
"# The stack pointer
The stack pointer contains the address of the next location on the stack.
The address in the stack pointer is decremented when data is pushed on the
stack and incremented when data is popped from the stack.
"# The index register
The index register is used to specify an address when certain addressing
modes are used. It is also known as the pointer register. The Microchip
devices use the name FSR (file select register).
"# The program counter
Perhaps the single most important CPU register is the program counter
(PC). The PC holds the address of the next instruction in program memory
space. It contains the address of the next instruction the CPU will process.
As each instruction is fetched and processed by the ALU, the CPU
increments the PC and thereby steps through the program stored in the
program memory space.
"# The accumulator
The accumulator is a register that can hold operands or results of
operations as necessary. The Microchip devices use the name W (working)
register.
Other registers may reflect results from the instruction just executed, control
the options available on the device, and enable access to certain areas of
memory.
2.3.7 ROM
14
Architecture
2.3.8 RAM
RAM, random access memory, is used to write and read data values as a
program runs. RAM is volatile: if you remove the power supply its contents are
lost. Any variables used in a program are allocated from RAM.
The time to retrieve information from RAM does not depend upon the location
of the information because RAM is not sequential, hence the term random
access.
Most small microcontrollers provide very little RAM which forces you to write
applications that use RAM wisely. Manipulating large data structures and using
pointers, re-entrant or recursive functions use large amounts of RAM and are
techniques which are generally avoided on microcontrollers.
Some C instructions which are rarely used on larger platforms are more
commonly used in C programs for microcontrollers. One example is the goto
instruction reviled by traditional C programmers. While goto is rarely used on
larger platforms, in embedded system programming it can sometimes be used
to save RAM.
If your hardware supports a stack, the stack contents and the space required to
manage the stack are usually allocated from RAM. A stack is a structure which
records the chronological ordering of information. It is used mainly in
subroutine calls and interrupt servicing. A stack is a LIFO (last in, first out)
structure. The following stack is from the Motorola MC68HC705C8. The stack
is 64 bytes from address 00C0 to 00FF:
0x00C0
Stack pointer
bit number 5 4 3 2 1 0
0x00FF 0 0 0 0 0 1 1 address
Figure 7: MC68HC705C8 stack
The stack pointer contains the address of the next free location on the stack.
On reset the stack pointer for the MC68HC705C8 holds the value 00FF. The
stack pointer is decremented when data is pushed on the stack and incremented
when data is popped from the stack.
15
Microcontroller Overview
There are two main port types, parallel and serial, and two port modes,
synchronous and asynchronous. Parallel I/O requires a data line for each bit,
while serial I/O uses a single line and transfers bits in sequence. Synchronous
I/O is synchronised to a clock while asynchronous I/O is not. Microcontrollers
most often have parallel I/O capability built in and serial I/O as a peripheral
feature.
The following are some sample port configurations:
"# The COP8SAA7 has four bidirectional 8 bit I/O ports called C, G, L and F
where each bit can be either input, output or tristate. Each port has an
associated configuration register and data register. It also has a
MICROWIRE/PLUS synchronous serial interface
"# The Motorola MC68HC705C8 has 3 8 bit ports called A, B, and C which
can be either inputs or outputs depending on the value of the data direction
register (DDR). There is also a 7 bit fixed input port called port D which is
used for serial port programming. This device also has a SCI (serial
communications interface) asynchronous serial interface and a SPI (serial
peripheral interface) which both use Port D for their functions.
"# The Microchip PIC16C74 has five ports: PORTA through PORTE. Each
port has an associated TRIS register which controls the direction. PORTA
uses the register ADCON1 to select analog or digital configuration.
PORTD and PORTE can be configured as an 8 bit parallel slave port. The
PIC16C74 has a SSP (synchronous serial port) module which can operate
both in SPI and I2C modes. The device also has a SCI module
Serial ports have a frequency of operation called their baud rate. The baud rate
is the reciprocal of the transmission time for each bit. For example, if the baud
rate is 9600 bits/second then the transmission time for each bit is 19600 of a
second.
While microcontrollers do not support the same sophisticated input/output
functions as larger platforms, such as those in the C stdio library, they still
support device I/O. The input/output channels allow the microcontroller to
communicate with such peripheral devices as timers, sensors, keypads and LCD
screens.
Microcontroller ports are usually memory-mapped and can therefore be used
like any other memory location. Ports usually consist of 8 or fewer bits which
16
Architecture
often support tristate logic with three states: input, output or high
impedance. High impedance mode is the state of being undefined or floating.
Some devices only support binary logic and in those cases the bit can be
defined as a combination of only two of the three states. If a port has
programmable input and output it will also have an associated register which
specifies whether the data is input or output. On many devices this register is
called the DDR (data direction register).
To reserve memory-mapped port locations so your compiler does not use them
for data memory allocation, you can use a #pragma preprocessor directive to
specify the location of each mapped I/O register. This also allows you to
provide a useful mnemonic name for each I/O port. You can then use the
variable name associated with the port to read or write to a particular I/O port.
The following defines two ports and their associated direction registers on the
Motorola 68HC705C8:
#pragma portrw PORTA @ 0x0000;
#pragma portrw PORTB @ 0x0001;
#pragma portrw DDRA @ 0x0004;
#pragma portrw DDRB @ 0x0005;
Example 1: Defining ports with #pragma directives
It is then possible to write the value AC to the port using the C command:
DDRA=0xFF; //set the direction to output
PORTA=0xAC; //set the port to the value AC
2.3.10 Timer
A timer is a counter that is incremented at a fixed rate when the system clock
pulses. There are several different types of timers available. A timer/counter
can perform several different tasks. The CPU uses the timer to keep track of
time accurately. The timer can generate a stream of pulses or a single pulse at
different frequencies. It can be used to start and stop tasks at desired times.
A COP (computer operating properly) or watchdog timer checks for runaway
code execution. The hardware implementation of watchdog timers varies
considerably between different processors. In general watchdog timers must be
turned on once within the first few cycles after reset and then reset periodically
with software. Some watchdog timers can programmed for different time-out
delays. The reset sequence is sometimes as simple as a specialized instruction or
as complex as sending a sequence of bytes to a port. Watchdog timers either
reset the processor or execute an interrupt when they time out.
17
Microcontroller Overview
18
Sample Microcontroller Configurations
2.3.12 Buses
A bus carries information in the form of signals. There are three main buses:
address, data, and control.
1) The address bus is unidirectional and carries the addresses of memory
locations indicating where the data is stored. The number of wires in the
address bus determines the total number of memory locations. With a 13
13
bit address bus, for example, there would be 2 or 8192 memory locations.
2) The data bus is bi-directional and carries information between the CPU
and memory or I/O devices. Computers are often classified according to
the size of their data bus. The term “8-bit microcontroller” refers to a
microcontroller with 8 lines on its data bus. The number of wires in the
data bus determines the number of bits that can be stored in each memory
location.
3) The control bus carries data which controls system activity. Often this data
includes timing signals which synchronize the movement of other
information.
19
Microcontroller Overview
Pins 40 or 44 pins
Clock 4MHz On-chip oscillator with crystal/ceramic resonator
RAM 176 bytes default (options include 208, 272 and 304)
ROM 7744 bytes default (options include 7696, 7648 and 7600)
Voltage 3.0 to 5.5 Volt
Registers Accumulator, Index, Program Counter, Stack pointer, Condition Code
Register
Timer(s) COP, 16 bit programmable timer
Ports 4: 8 bit I/O ports PORTA, PORTB and PORTC, 7 bit input PORTD
Interrupts 5 interrupts: IRQ pin, SWI, SPI, SCI and timer
Serial SPI (serial peripheral interface), SCI (serial communications interface)
Options Clock monitor
Table 1: Hardware characteristics of the Motorola MC68HC705C8
The PIC16C54 is a member of the PIC16C5x family. These are 8 bit, EPROM-
based CMOS microcontrollers. The PIC16C54 is Harvard architecture.
20
Sample Microcontroller Configurations
Pins 18 pins
Clock 20 MHz user selectable from low power crystal, crystal, high speed
crystal, R/C
RAM 32 bytes
ROM 512 words (12 bits)
Voltage 2.5 V to 6.25 V
Registers status, option, INDF, FSR, program counter, Working
Timer(s) watchdog, 8 bit timer, reset timer
Ports 4 bit I/O Port A, 8 bit I/O Port B
Interrupts 1
Serial none
Options none
Table 3: Hardware characteristics of the Microchip PIC16C54
The PIC16C74 is a member of the PIC16C7x family. These are 8-bit, EPROM-
based CMOS microcontrollers. The PIC16C74 is Harvard architecture.
Pins 40/44
Clock 20 MHz
RAM 192
ROM 4K
Voltage 3 to 6
Registers 48 including Status, Option, Intcon, PIE1, PIR1, PIE2, PIR2, PCON, PCL,
PCLATH,INDF, FSR
Timer(s) 2 8 bit, 16 bit, watchdog
Ports 6 bit PORTA, 8 bit PORTB, PORTC, parallel PORTD, 3 bit PORTE also
TRISA, TRISB, TRISC, TRISD, and TRISE
Interrupts 12
Serial SPI, I2C, SSP, SCI
Options A/D Converter
Table 4: Hardware characteristics of the Microchip PIC16C74
21
3. The Embedded Environment
Microcontrollers used in development projects have very limited resources. You
are working close to your target machine and you must be familiar with your
target hardware construction and operation.
A good quality C development environment incorporates tools which allow you
to concentrate primarily on your applications and not on the hardware which
runs them. However, you cannot ignore low-level details of your target
hardware. The better you understand your run-time environment, the better
you can take advantage of its limited capabilities and resources.
Reliability
Embedded systems must be reliable. Personal computer programs such as word
processors and games do not need to achieve the same standard of reliability
that a microcontroller application must. Errors in programs such as word
processors may result in errors in a document or loss of data. An error in a
microcontroller application such as a television remote control or compact disc
player will result in a product that does not work and consequently does not
sell. An error in a microcontroller application such as an antilock braking
system or autopilot could be fatal.
Efficiency
Issues of efficiency must be considered in real time applications. A real time
application is one in which must be able to act at a speed corresponding with
the occurrence of an actual process.
Cost
Many embedded systems must compete in a consumer market and cost is an
important issue in project development.
23
The Embedded Environment
CMOS
Complementary Metal Oxide Semiconductor (CMOS) is a technique commonly
used to fabricate microcontrollers. CMOS requires less power and CMOS chips
can be static which allows the implementation of a sleep mode. CMOS
microcontrollers must have all inputs connected to something.
PMP
Post Metal Programming (PMP) allows ROM to be programmed after final
metalization. This allows ROM to be programmed very late in the productions
cycle.
3.3.1 RAM
Random access memory1 or RAM consists of memory addresses the CPU can
both read from and write to. RAM is used for data memory and allows the CPU
to create and modify data as it executes the application program.
RAM is volatile, it holds its contents only as long as it has a constant power
supply. If power to the chip is turned off, the contents of RAM are lost. This
does not mean that RAM contents are lost during a chip reset. Vital state
information or other data can be recorded in data memory and recovered after
an interrupt or reset.
1random access memory is used because the CPU can access any block of memory in
RAM in the same amount of time. This differs from sequential storage such as tape
where access time differs for different parts of the storage space.
24
Memory Addressing and Types
Some chips provide an alternate RAM power supply so that memory contents
can be maintained even when the rest of the chip is without power. This does
not make RAM any less volatile, without a backup power source the contents
would still be lost. This type of RAM is called battery backed-up static RAM.
3.3.2 ROM
ROM, read only memory, is typically used for program instructions. The ROM
in a microcontroller usually holds the final application program.
Maskable ROM is memory space that must be burned in by the manufacturer
of the chip as it is constructed. To do this, you must provide the chip builder
with the ROM contents you wish the chip to have. The manufacturer will then
mask out appropriate ROM blocks and hardwire the information you have
provided.
Since recording chip ROM contents is part of the manufacturing process, it is a
costly one-time expense. If you intend to use a small number of parts, you may
be better off using chips with PROM. If you intend to use a large number of
parts for your application, then the one-time expense of placing your program
in ROM is more feasible.
3.3.3 PROM
3.3.4 EPROM
EPROM (erasable programmable ROM) is not volatile and is read only. Chips
with EPROM have a quartz window on the chip. Direct exposure to ultra-violet
25
The Embedded Environment
radiation will erase the EPROM contents. EPROM devices typically ship with a
shutter to cover the quartz window and prevent ambient UV from affecting the
memory. Often the shutter is a sticker placed on the window.
Developers use an EPROM eraser to erase memory contents efficiently. The
eraser bombards the memory with high-intensity UV light. To reprogram the
chip, an EPROM programmer is used, a device which writes instructions into
EPROM.
The default, blank state for an EPROM device has each block of memory set.
When you erase an EPROM you are really setting all memory blocks to 1.
Reprogramming the device resets or clears the appropriate EPROM bits to 0.
Because of the way EPROM storage is erased, you can not selectively delete
portions of EPROM – when you erase the memory you must clear the entire
storage space.
3.3.5 EEPROM
26
Memory Addressing and Types
3.3.7 Registers
27
The Embedded Environment
28
Interrupts
To use the global variable as a loop counter within a function, the following
code could be used:
int somefunc() {
ScratchPad.asShort=0;
while (ScratchPad.asShort < 10) {
// some code
ScratchPad.asShort += 1;
} // end while
return (someIntValue);
}
Example 3: Using globally allocated data space in a function
3.4 Interrupts
2The @ symbol uses the address allocated to __longIX for the new symbol myTemp.
This is not standard C so the syntax your compiler provides may be different.
29
The Embedded Environment
NOTE
It is vital that you understand how your target hardware implements interrupts as
this affects both the service routines you must write and how you write them.
In general, you must write an interrupt service routine for each interrupt your
target hardware can detect even if the handler consists solely of a return from
interrupt or a similar instruction.
Interrupts are asynchronous: they are events that can occur during, after, or
before an instruction cycle. Interrupt acknowledgement can be either
synchronous or asynchronous. Most interrupt acknowledgement is synchronous, the
instruction currently being executed is completed before the interrupt is
acknowledged.
Theoretically, when the processor acknowledges an interrupt asynchronously it
halts execution of the current instruction and immediately services the interrupt.
The only asynchronously acknowledged interrupt is RESET. Since RESET
erases the state of the machine, it is a moot point whether the CPU actually
halts execution of the current instruction or not.
When the processor acknowledges an interrupt synchronously, it finishes
executing the current instruction and, before it performs a fetch for the next
instruction, it services the interrupt.
30
Interrupts
There are two general ways in which microcontrollers service interrupts, each
with several variations.
31
The Embedded Environment
On most chips, the interrupt process saves the state of the machine including
the current program counter, stack pointer, and register contents. This is done
to ensure that after an interrupt is serviced execution will resume at the
appropriate point in main program with no loss of data.
32
Interrupts
Some chips save the machine state automatically while others will only save a
portion of the machine state. In the second case it is up to the programmer to
provide code which saves the current state. Usually each interrupt handler will
do this before attempting anything else. The location and accessibility of the
saved state information varies from machine to machine. In most cases, it is
saved on a stack located in data memory.
For example the Motorola MC68HC705C8 saves the machine state in the stack
as follows:
1 1 1 Condition Codes
Accumulator
Index Register
0 0 0 PC High
33
The Embedded Environment
MC68HC705C8 the interrupt mask bit of the Condition Code Register is set to
prevent interrupts.
Some machines provide a small number of non-maskable interrupts (NMI).
Interrupts that can be disabled are maskable, those which you cannot disable are
non-maskable. For example, RESET is non-maskable – regardless of the code
currently executing the CPU must service a RESET interrupt. Some
microcontrollers also designate software interrupts or BREAK instructions that
you can use as a non-maskable interrupt.
What happens after the CPU services an interrupt? This varies depending upon
target hardware. In general, the CPU first checks for any outstanding interrupts.
One some machines the CPU first fetches an instruction and then checks for
interrupts after executing this instruction. This guarantees that no matter how
many interrupts cue up, the machine will always step through program code and
no more than one interrupt handler will execute between each main program
instruction.
On most machines the CPU will check for interrupts before performing the
next instruction fetch. As long as it detects a pending interrupt it will service the
interrupt before fetching the next instruction. This means it is possible to halt a
program by continuously sending interrupts. On the other hand, it guarantees
that an interrupt is serviced before any more main program code is executed.
When an interrupt occurs the signal sets a register bit. When the CPU checks
for pending interrupts it reads the register for set bits. Upon completing an
interrupt handler, the appropriate bit in the register is cleared.
How does the CPU decide which interrupt to service first? Each interrupt a
chip can detect has a precedence, the chip services those interrupts with a
higher precedence first.
Microcontrollers vary widely in the types of interrupts they can detect. Some
general types are widely available in one form or another. The only universal
interrupt is RESET and some simple chips support no other interrupts.
34
Specific Interrupts
3.5.1 RESET
The RESET interrupt prompts the chip to behave as if the power has been
cycled. It does not actually cycle the power to the chip. This means that the
contents of volatile memory, typically data memory, can remain intact. The
reset vector contains the address of the first instruction that will be executed
by the CPU.
You can write an initialization routine to be executed before any other program
code which first checks specific locations in data memory for particular values
and then loads values into those locations. This can be used to check if the
RESET was cold, power cycled, or warm, power not cycled. Some compilers
support a initialization function which is executed upon RESET before the
main program.
On most chips, RESET causes the CPU to halt execution immediately and
restart itself. On some chips, RESET may finish the current instruction. Each
microcontroller performs a series of actions when it detects a RESET. For
example, when a RESET occurs on the Motorola MC65HC705C8 the
following actions occur:
1) Data direction registers are cleared
2) Stack pointer is set to 0x00FF
3) CCR I bit is set
4) External interrupt latch is cleared
5) STOP latch is cleared
6) WAIT latch is cleared
A RESET can occur because of a manual reset, a COP time out, low voltage,
initial power on, or an attempt to execute an instruction from an illegal address.
Some chips that support interrupts provide an instruction in the instruction set
which the programmer can use to halt program execution. This instruction
name is different for different devices.
The COP8SAA7 has a Software trap which occurs when the INTR instruction
is placed in the instruction register. The software trap is used for unusual and
35
The Embedded Environment
3.5.3 IRQ
IRQ interrupts are physical pins or ports on the chip which generate an
interrupt when they are sent a signal. Some chips do not support IRQ type
interrupts and those that do implement them in many different ways. The
number of pins available for IRQs varies widely from chip to chip.
The developer usually has the ability to configure the IRQ interrupts to detect
signals in specific ways. For example, they can be made sensitive to a signal
edge, a signal hold, or a signal fall.
For example, the Microchip PIC17C42a has an INT external interrupt pin. The
developer can set the interrupt trigger to be either the rising edge or falling edge
by setting an appropriate register bit. The INT interrupt can be disabled by
clearing the appropriate control bit.
3.5.4 TIMER
code in your program to touch the watchdog at regular intervals before the time
period expires. If your program leaves the watchdog too long without attention,
the configured time period passes with no touch instruction, the watchdog
activates the RESET interrupt.
This type of timer interrupt provides your program with an independent safety
net. Since the watchdog timer depends only upon the clock signals to do its job,
if your program ever fails the watchdog will realize that the computer is not
operating properly and will activate a RESET.
3.6 Power
Most microcontrollers support 4.5 to 5.5 Volt operation. There are also many
low voltage parts which are designed to work at 3 volts or less.
3.6.1 Brownout
3.6.2 Halt/Idle
Input and output are lines or devices which carry information between the
microcontroller and the outside world.
3.7.1 Ports
37
The Embedded Environment
input/output. Often the port state is set with a direction register which
determines if the port is input, output or input/output. When a port pin is an
output it is a latched output. This means that when the pin is in a given state,
set or unset, it will remain in that state until reset.
Microcontrollers usually contain several ports.
For example, the Microchip PIC16C74 has five ports called PORTA, PORTB,
PORTC, PORTD and PORTE. PORTA is 6 bit latch which is configured as
input or output using the register TRISA. PORTA can also be configured as
analog or digital using the ADCON1 register. PORTB is an 8 bit bi-directional
port with data direction register TRISB.
The National Semiconductor COPSAA7 contains four bi-directional ports:
PORTC, PORTG, PORTL and PORTF. Each bit can be configured as input,
output or trisate.
CAN
Controlled Area Network was developed by Bosh and Intel. It is a multiplexed
wiring scheme.
J1850
J1850 is the SAE (Society of Automotive Engineers) standard.
38
Input and Output
UART
A Universal Asynchronous Receiver Transmitter is a serial port adapter that
receives and transmits serial data with each data character preceded by a start bit
39
The Embedded Environment
and followed by a stop bit. There is sometimes a parity bit included. A UART is
used mainly as a serial to parallel and parallel to serial converter.
USART
A Universal Synchronous/Asynchronous Receiver Transmitter is a serial port
adapter used for synchronous or asynchronous serial communication.
Flash converter
Examines each level and decides what level the voltage is at. It is very fast, but
draws a great deal of current and is not feasible beyond 10 bits.
40
Miscellaneous
3.9 Miscellaneous
The clock monitor watches the clock and determines if it is running too slow. It
can activate a microcontroller reset.
3.10 Devices
ROM whose contents are set by masking during the manufacturing process.
3.10.3 OTP
41
4. Programming Fundamentals
It is necessary to understand some basic computer programming concepts
before learning C programming.
The most important thing to remember about computers is that they can do
only what they are instructed to do. To accomplish a meaningful task on a
computer, someone must give it exhaustive and very explicit instructions. A
collection of such instructions is a called a program and the person who writes
and revises these instructions is known as a programmer or developer.
There are several different number systems. We are used to the decimal number
system which is of base 10. This means that it has ten digits and coefficients are
2 1
multiplied by powers of 10. For example 456 is the same as 4(10 ) + 5(10 ) +
0
6(10 ) = 400 + 50 + 6 = 456.
Computers use the binary number system with base 2: it has two digits (0 and 1)
and the coefficients are multiplied by powers of 2. For example 110 is the same
2 1 0
as 1(2 ) + 1(2 ) + 0(2 ) = 6.
The hexadecimal number system is often used as it is easier to read than binary
numbers. It is base 16 and uses 0-9 and A-F to represent values.
43
Programming Fundamentals
When people read and write they use extremely powerful and flexible coding
systems called alphabets. Computers, however, can only handle information
written in the most simple coding system possible — binary notation. The
binary alphabet has only two components: 1 and 0.
44
Binary Information
Signal
Figure 9: Data storage VS. data value
In the example the data is stored at the address 00 Hex. The data stored at that
location has the value 1101 1101 binary or DD hexadecimal.
One bit in computer memory can record either 0 or 1 because it contains a
single binary digit which can exist in only two states. Two bits read in sequence,
however, can record four possible numbers: 0, 1, 2, and 3 because two bits can
exist in the states 00, 01, 10 and 11. As with decimal notation, the first digit
records the multiples of one included in the number. The second digit records
the multiples of two since the computer only has two digits available. Adding a
third digit allows for the encoding of multiples of 4.
Bits are often grouped in sets. 8 bits make 1 byte, while 16 bits make one word.
10
Standard terminology refers to 2 (1024) bytes as a kilobyte.
A programmer can give the computer information and instructions using long
strings of 1’s and 0’s. However, this process would be very time consuming and
prone to error. To resolve these problems programmers have developed
languages in which to write programs. Languages help the programmer by
making the job of programming a computer faster, more efficient, and more
reliable.
45
Programming Fundamentals
Computer memory is divided up into addresses. Each address holds an 8 bit (or
1 byte) value. The number of address lines determines the number of locations
available. For example, the MC68HC05C8 can address 8192 bytes of memory.
13
Sine each bit can hold one of two values (0 or 1) and 2 = 8192 we know that
there are 13 address lines. The first address will be the value 0 0000 0000 0000
(0x0000) and the last address will be the value 1 1111 1111 1111 (0x1FFF).
Microcontrollers have different addressing modes which allow them to access
locations in memory as quickly as possible. For example, the first 256 locations
on the Motorola MC68HC705C8 can be accessed using direct addressing mode
where the CPU assumes that the high byte of the address is 00000000.
46
Instruction Sets
8 bit microcontrollers usually use byte size instruction codes. Each instruction
has two possible components: an opcode and an operand. The opcode is the
function that the instruction performs while the operand is data used by the
opcode. Neither opcodes nor operands are restricted to 1 byte.
There are several different types of addressing modes. An addressing mode is
simply the means by which an address is determined. Some common modes are
immediate data, direct address, and indirect or indexed address.
4.6.1 Assembler
47
Programming Fundamentals
The processor can execute these instructions at a very high speed. RISC uses a
technique called pipelining the processing of instructions can be overlapped.
For example, one instruction can be read from memory while another is
decoded and another is executed. Many RISC machines have a single
instruction size and a small number of addressing modes.
Some microcontrollers are called SISC (Specific Instruction Set Computer)
machines. This is based on the fact that the instruction sets are designed
specifically for control purposes.
Instructions Instruction Cycle Address Modes
Size
MC68HC705C8 58 8, 16 or 24 bit 2 to 11 10
COP8SAA7 56 8, 16 or 24 bit 1 to 5 10
PIC16C54 33 12 bit 1 or 2 3
PIC16C74 35 14 bit 1 or 2 3
Table 8: Instruction set comparisons
48
The Development of Programming Languages
One of the most important tools that programmers developed to deal with new
high level languages is the language compiler.
49
Programming Fundamentals
4.9 Compilers
When programs were written in the past often the development computer was
not powerful enough to hold the entire program being developed in memory at
one time. Historically, programs had to be divided into separate modules where
each module would be compiled into object code and a linker would link the
50
Cross Development
object modules together. Our development machines today are very powerful
and the use of a linker is no longer absolutely necessary.
Many implementations of C provide function libraries which have been pre-
compiled for a particular computer. These functions serve common program
needs such as serial port support, input/output, and description of the
destination computer. Functions within libraries are usually either linked with
modules which use them or included directly by the compiler if the compiler
supports library function inclusion.
When your program has been pre-processed, compiled and linked, the
destination computer will be able to read and execute your program.
A cross compiler runs on one type of computer and produces machine code for
a different type of computer. While many 8 bit embedded microcontrollers can
support sophisticated and extremely useful programs, they are not powerful
enough to support the resource needs of a C development environment. How
does a developer create and compile programs for an 8 bit microcontroller? By
using a cross compiler.
51
Programming Fundamentals
programmer to imprint the translated program into the memory of the 8 bit
microcontroller.
Simulator
A simulator is a software program which allows a developer to run a program
designed for one type of machine (the target machine) on another (the
development machine). The simulator simulates the running conditions of the
target machine on the development machine.
Using a simulator you can step through your code while the program is running.
You can change parts of your code in order to experiment with different
solutions to a programming problem. Simulators do not support real interrupts
or devices.
An in-circuit simulator includes a hardware device which can be connected to
your development system to behave like the target microcontroller. The in-
circuit simulator has all the functionality of the software simulator while also
supporting the emulation of the microcontroller’s I/O capabilities.
Emulator
An emulator or in-circuit emulator is a hardware device which behaves like a
target machine. It is often called a real time tool because it can react to events as
the target microcontroller would. Emulators are often packaged with monitor
programs which allow developers to examine registers and memory locations
and set breakpoints.
52
Cross Development
Problem Specification
The problem specification is a statement of the problem that your program will
solve without considering any possible solutions. The main consideration is
explaining in detail what the program will do.
Once the specification of the problem is complete you must examine the
system as a whole. At this point you will consider specific needs such as those
of interrupt driven or timing-critical systems.
For example, if the problem is to design a home thermostat the problem
specification should examine the functions needed for the thermostat. These
function may include reading the temperature, displaying the temperature,
turning on the heater, turning on the air conditioner, reading the time, and
displaying the time. Based on these functions it is apparent that the thermostat
will require hardware to sense temperature, a keypad, and a display.
Tool/Chip Selection
The type of application will often determine the device chosen. Needs based on
memory size, speed and special feature availability will determine which device
will be most appropriate. Issues such as cost and availability should also be
investigated.
The tools available will also impact a decision to develop with a certain device.
It is essential to determine if the development decisions you have made are
possible with the device you are considering. For example, if you wish to use C
you must select a device for which there is a C language compiler. It is also
useful to investigate the availability of emulators, simulators and debuggers.
Software Plan
The first step in the software plan is to select an algorithm which solves the
problem specified in your problem specification. Various algorithms should be
considered and compared in terms of code side, speed, difficulty, and ease of
maintenance.
Once a basic algorithm is chosen the overall problem should be broken down
into smaller problems. The home thermostat project quite naturally breaks
down into modules for each device and then each function of that device.
For example, the thermostat may have a display to the LCD display module and
a read from the keyboard module.
53
Programming Fundamentals
Device Plan
The routines for hardware specific features should also be planned. These
routines include:
1) Set up the reset vector
2) Set up the interrupt vectors
3) Watch the stack (hardware or software)
4) Interact with peripherals such as timers, serial ports, and A/D
converters.
5) Work with I/O ports
Code/Debug
The modules from the software plan stage are coded in the project language.
The coded modules are compiled or assembled and all syntactic error are
repaired.
Debugging should consider issues such as:
"# Syntactic correctness of the code
"# Timing of the program
Test
Each module should be tested to ensure that it is functioning properly. This
testing is done using simulators and/or emulators. It is also important to test
the hardware you will be using. This is easily done by writing small programs
which test the devices.
Integrate
The modules must be combined to create a functioning program. At this point
is important to test routines which are designed to respond to specific
conditions. These routines include interrupt service and watchdog support
routines. The entire program should now be thoroughly tested.
54
5. First Look at a C Program
Traditionally, the first program a developer writes in the C language is one
which displays the message Hello World! on the computer screen. This is
a sensible beginning for traditional C platforms where conventional input and
output are important and fundamental concepts.
In the world of 8 bit microcontrollers device input and output play radically
different roles. Programs rarely have access to keyboard input or screen output
devices which are common in traditional C programming 3.
The following introductory program is representative of microcontroller
programming. The program tests to see if a button attached to a controller port
has been pushed. If the button has been pushed, the program turns on an LED
attached to the port, waits, and then turns it back off.
#include <hc705c8.h>
// #pragma portrw PortA @ 0x0A; is declared in header
// #pragma portw PortADir @ 0x8A; is declared in header
#define INPUT 1
#define OUTPUT 0
#define ON 1
#define OFF 0
#define PUSHED 1
void main(void){
PortADir.0 = OUTPUT; //set pin 0 for output (light)
PortADir.1 = INPUT; //set pin 1 for input (button)
while (1){ // loop forever
if (PortA.1 == PUSHED){
wait(1); // is it a valid push?
if (PortA.1 == PUSHED){
PortA.0 = ON; // turn on light
wait(10); // delay (light on)
PortA.0 = OFF; // turn off light
}
}
}
} //end main
Example 6: A typical microcontroller program
3 Most C compilers for 8 bit microcontrollers do not use stdio libraries as these
libraries provide functions for input and output rarely used on 8 bit machines.
55
First Look at a C Program
NOTE
Always comment your code. Even if you are sure no other programmer will ever
look at your code, a near impossibility, you will still need to understand it. You will
often rework code months and even years after it was originally written. Comments
drastically improve code readability.
56
Preprocessor directives
#include <hc705c8.h>
#include is one of the most commonly used preprocessor directives. When
the preprocessor reaches this directive it looks for the file named in the
brackets. In the example above the preprocessor searches for the file
hc705c8.h which contains device specific specifications for the Motorola
68HC705C8.
If the file is found the preprocessor will replace the #include directive with
the entire contents of the file. If the file is not found the preprocessor will halt
and give an error.
In the example the #include directive is used to include the contents of a
header file. By convention, C language header files have the .h extension.
Header files contain information which is used by several different sections, or
modules, of a C program as they contain preprocessor directives and
predefined values which help to describe the resources and capabilities of a
specific target microcontroller.
#define ON 1
#define OFF 0
#define is another commonly used preprocessor directive which is used to
define symbolic constants. Programs often use a constant number or value
many times. Instead of typing in the actual number or value throughout the
program, you can define a symbol which represents the value. When the
preprocessor reaches a #define directive, it will replace all the occurrences of
the symbol name in your program with the associated constant. Constants are
useful for two specific reasons:
1) Increasing program readability. A symbolic name is more descriptive
than a number. For instance, the name ON is easier to understand than the
value 1. Using symbolic constants enhances the readability of your
programs and makes them easier to test, debug and modify.
2) Increasing program modifiability. Since the symbolic constant value is
defined in a single place, only one change is necessary if you wish to modify
the value: in the #define statement. Without the #define statement it
would be necessary to search through the entire program for every place
the value is used and change each one individually.
In the statements #define ON 1 and #define OFF 0, the symbols ON
and OFF are assigned the values 1 and 0 respectively. Everywhere the
57
First Look at a C Program
preprocessor sees the symbol ON it will replace it with the constant 1; where it
sees OFF it will replace it with the constant 0.
5.3 C Functions
C programs are built from functions. Functions typically accept data as input,
manipulate data and return the resulting value. For example, you could write a
C function that would take two numbers, add them together and return the sum
as the result.
When a computer runs a C program, how does it know where the program
starts? All C programs must have one function called main()which is always
the first function executed.
Notice the notation for main(). You specify a function name by following
the name with parentheses. This is the notation used by the C compiler to
determine when it has encountered a function. As long as the name is not a
recognised C command, called a reserved word, the compiler will assume it is a
function if it is immediately followed by a pair of parentheses. The parentheses
may also surround a list of input arguments for the function.
58
C Functions
void main(void){
//function statements
}
Example 7: Syntax for the main( ) function
Example 7 is the definition for the main() function in Example 6. All the
statements that fall between the two braces, {}, have been omitted for
example purposes.
The first use of the term void, prior to the word main, indicates to the
compiler that this main() function does not return any value at all. The
second use of the term void, between the parentheses, indicates to the
compiler that this main() function is not sent any data in the form of
arguments.
Braces must surround all statements which form the body of a function. Even
functions with no statements in their body require the braces – the braces after
a function header indicate to the compiler that you are providing the definition
of a function.
The main() function can execute code from other functions. This is referred
to as calling another function. The calling function must know about the called
function in order to execute its code. A function knows about another function
in two ways:
1) The entire definition of the called function is positioned earlier in the
source file than the calling function.
2) A function prototype of the called function is included before the calling
function in the same source file.
A function prototype describes details of the requirements of a function so
that any program code that calls that function will know what information the
called function requires. The following is a typical function prototype:
void wait(registera);
The example above is a function prototype for a function called wait(). This
function is preceded by the return value void; therefore, it does not return a
value. Unlike main(), the wait() function does expect to receive an
argument, called a parameter. The type of the parameter (registera) is
59
First Look at a C Program
important. It indicates the type of value the parameter will hold – a value of
type registera.
60
The Function Body
pour into the coffee maker depends upon the number of cups you want to
make. At some point in your instructions, you need to allow the person
following them to make a decision about the number of cups needed and,
therefore, the amount of water needed. You might say: “if you want to make 4
cups of coffee, then use 5 cups of water”.
In C decisions are made using control statements. Control statements can select
between two or more paths of execution and repeat a set of statements a given
number of times. Some common control statements are:
while
while(1){
// statements
}
The while() control statement instructs the computer to repeat a set of
instructions (loop) as long as a condition is valid. The condition is an
expression placed in the brackets which follow the while statement. C
considers any condition which does not evaluate to 0 to be true and any
condition which does evaluate to 0 to be false.
In Example 6 the condition is the integer 1b (binary), which is interpreted as
true. Therefore, once the computer begins to execute statements inside the
braces of the while loop, it will not terminate until the computer
malfunctions or is turned off. This kind of loop is often called an infinite loop.
In traditional C programming, an infinite loop is usually avoided. However, it is
often used in embedded systems programming. An embedded controller
typically executes a single program “infinitely”. Only when the controller is
reset or turned off will the loop terminate.
if
if (PortA.1 == PUSHED){
PortA.0 = ON;
}
Example 9: The if statement syntax
while (1){
if (PortA.1 == PUSHED){
PortA.0 = ON;
}
}
Example 10: Nesting if and while statements
When the if decision is placed inside a while loop, the program will test bit
1 in PortA regularly. Assume a button is attached to pin 1 of port A and an
LED to pin 0 of PortA. We have written a small control program which will
continually test the button attached to pin 1. When the button is pushed, bit 1
of PortA will be set. When bit 1 is set and the if statement is executed, bit 0 is
set. The LED attached to PortA pin 0 will be set to 1 and will light up.
A program can delegate a task by calling another function. Once the program
turns on the LED in Example 10 it never turns it off. Remember, the while
loop is an infinite loop. How can we solve this problem?
One solution is to write a function called wait()which creates a delay and
then turn the LED off. Consider the following example code fragment:
while (1)
{
if (PortA.1 = PUSHED)
{
PortA.0 = ON;
wait(10); \\ wait ten seconds
PortA.0 = OFF;
}
}
Example 11: Calling one function from another
When the wait() function is used and the button is pushed, the program
turns the LED on by setting bit 0 of PortA. The wait() function causes a
delay of ten seconds. After the wait function has finished and ten seconds have
passed, the program turns off the LED by clearing bit 0 of PortA.
62
The Embedded Difference
Most embedded systems programs include a header file which describes the
target processor. These header files contain descriptions of reset vectors, ROM
and RAM size and location, register names and locations, port names and
locations, register bit definitions and macro definitions. Most compiler
companies will provide header files for devices supported by their compilers.
Another important aspect of device knowledge is the limits of the device for
which the program is written. For example, a certain device may have very
limited memory resources and great care must be taken in developing programs
which use memory frugally. Along with issues of size comes issues of speed.
Different devices run at different speeds and use different techniques to
synchronise with peripherals. It is essential that you understand device timing
for any embedded systems application.
The previous section mentioned the regular use of the infinite loop
while(1). Embedded developers often use program control statements
which are avoided by other programmers. For example, the goto statement is
used regularly by embedded developers and is often condemned by other
programmers.
Many developers prefer to write some code segments in assembly language for
reasons of code efficiency or while converting a program from assembly
language to C. Most compilers for 8 bit microcontrollers allow the inclusion of
inline assembly, assembly language in a C program.
63
First Look at a C Program
The following two definitions of the wait() function show the function
written in C and the equivalent function in Motorola 68HC705C8 assembly
language.
//C function
void wait(registera delay){
while (--delay);
}
64
6. C Program Structure
The previous section described some typical features of a very simple program.
In this section we will examine in greater detail the building blocks of the C
language.
A C program is built from three components:
1) Directives are directives handled directly by the preprocessor
2) Declarations are instructions for the compiler to record the type
and associated memory locations of symbols
3) Statements are the executable instructions in a program
Declarations define the name and type of identifiers used in your program. One
benefit of programming in a high level language is the ability to construct
generic groups of instructions, called functions, to perform tasks whose steps
are not dependant upon specific values. For example, you can write
instructions to add together two numbers without knowing the values of the
numbers. How can this be done? Through the use of identifiers.
An identifier can either represent a value, called a variable, or a group of
instructions, called a function. C identifiers represent addresses in computer
65
C Program Structure
The compiler allocates memory for all identifiers. As the compiler reads a
program, it records all identifier names in a symbol table. The compiler uses the
symbol table internally as a reference to keep track of the identifiers: their name,
type and the location in memory which they represent.
When the compiler finishes translating a program into machine language, it will
have replaced all the identifier names used in the program with instructions that
refer to the memory addresses associated with these identifiers.
66
Identifier Declaration
Notice that even though the two identifiers are different words, the first five
significant characters are identical.
Identifiers which represent constant data values are allocated from computer
program memory space. Identifiers which represent constant data values do not
require alterable memory: once the value of a constant has been written in
67
C Program Structure
memory it need never change. Therefore, the compiler will allocate a block of
its program memory space, usually in ROM, for each of these identifiers4.
To declare a constant data value, use a declaration such as:
const int maximumTemperature = 30;
Function identifiers are not altered during program execution. Once the value
of a function has been written in the computer’s memory it need never change.
When a function is defined, the compiler places the program instructions
associated with the function into ROM. What happens to the local variables
used in a function’s body of statements? The compiler will write in the data
memory addresses where local variable values will be stored in RAM when the
program runs.
6.3 Statements
The compiler will generate an instruction to store the value 20 in the RAM
memory location set aside for the currentTemperature variable.
4This is not always the case. However, you can safely assume that all C constant
values are stored in machine program memory space.
68
Statements
Forgetting the semicolon at the end of the first line forces the compiler to read
both lines as one statement instead of two. According to the compiler you have
written the following instruction:
currentTemperature = 20 currentTemperature = 25;
Notice that the C compiler does not care about white space between tokens as
it reads through your program. White space includes space, tab and end-of-line
characters. On some computers the end of line will be a single linefeed
character, while on others it will be a linefeed and carriage return together. C
compilers ignore both carriage returns and linefeeds.
When you write a C function you must include function statements as part of
the function definition. Statements belonging to a function are indicated by
surrounding them with braces which immediately follow the function header.
For example, Example 6 in the previous section has braces surrounding all the
statements in the main() function.
You may create statement blocks at other times in your program. For example,
notice the braces after the while and if statements:
while (1){ // this brace begins the block for while
if (PortA.1 = 1){ // this brace begins the if block
PortA.0 = 1;
wait(10);
PortA.0 = 0;
} // this brace closes the block for if
69
C Program Structure
70
7. Basic Data Types
It is easy to see how the computer stores binary values in memory as that is the
manner in which its memory is structured. We have also seen how the
computer stores other types of numbers, such as hexadecimal and decimal, by
converting them to binary form. This section examines how other types of data
can be used.
A computer can store a number in its memory. What about a character? People
use alphabets to encode linguistic information while computers must use binary
notation. To resolve this problem, computer programmers have settled on
encoding schemes for representing characters with numbers such as ASCII
(American Standard Code for Information Interchange) encoding. In the ASCII
character set each character is associated with an integer value. When the
computer needs to store a character it uses the ASCII integer value associated
with that character and stores the number in binary notation.
Data types act as filters between your program and computer memory. Data
types in C provide rules for the storage and retrieval of information from
computer memory. C data types also provide a set of rules for acceptable data
manipulation.
The primary distinguishing characteristic of a data type is its size. The size of a
data type indicates the amount of memory the computer must reserve for a
value of that type. For example, on 8 bit microcontrollers the int data type
(used for storing integer values) is a single byte in size6 while a long or long
int data type is two bytes in size. When the compiler translates a program it
must write the instructions to account for this size difference. The computer
6 The size of the int data type in C is the same as the amount of information the
computer can process. Since 8 bit microcontrollers work with a byte at a time the size
for int is 1 byte (or 8 bits). On most modern computers int varies from 24 bits to 64
bits.
71
Basic Data Types
will know to set aside a single byte for all the int values in your program and
two bytes for all the long values.
A declaration can also be used to ensure that a variable will be assigned a certain
value when it is allocated. When this is done the compiler allocates the
appropriate space for the variable and immediately assigns a value to that
memory location, for example: int currentTemperature = 20;
allocates 1 byte for the variable currentTemperature and assigns it the
value 20. This is called initializing the variable.
Initialization ensures that a variable contains a known value when the computer
executes the first statement which uses that variable. If variables are not
initialized in their declarations, their values are unknown until they are
initialized.
When the compiler comes across a variable declaration it checks that the
variable has not previously been declared and then allocates an appropriately
72
Basic Data Types
sized block of RAM. For example, an int variable will require a single word (8
bits) of RAM or data memory7.
When the compiler allocates memory for a variable it decides where to place the
variable value based on the existing entries in its symbol table. Since the
compiler cannot know what value lies at the address allocated for a particular
variable at compile-time, you can not depend upon a specific value for a
variable the first time it is used.
Compile-time is the point at which the compiler translates a program into
machine code. Run-time indicates the point at which the machine code is
executed on the host computer. It is useful to remember that compilers have
little or no knowledge about a machine’s internal state at run-time.
Declarations that initialise variables are very useful – they ensure that you can
predict what a variable memory location will contain at run-time. When the
compiler reads a declaration which also initializes a variable it first allocates an
appropriate block of memory, then immediately loads the appropriate value into
that location.
Please note that variable declarations which contain an initialization will
automatically generate machine code to place a value at the address allocated for
the variable. Normal variable declarations do not generate any code because the
machine code contains the address allocated for such a variable. This is not the
case for either global variables or static local variables – if they are not initialized
in their declaration the compiler will initialize them by setting their initial values
to 0. The compiler will produce machine instructions to load the 0 value into
the appropriate addresses.
7The amount of memory required for an integer variable varies from computer to
computer. 8 bit microcontrollers have a natural integer size of 8 bits.
73
Basic Data Types
If you declare a variable outside all statement blocks, the scope of the variable
reaches from its declaration point to the end of the source file. Variables
declared in this manner are called global variables because they can be used by
any program code which comes after them in the same source file.
A variable declared outside a statement block can be accessed by any statement
in your program by declaring the variable in a certain way. In order for a
statement block or separate program file to access to such a variable, it must be
declared as an external symbol. This means using the extern storage class
modifier. For example, the following declaration tells the compiler to look for
the original declaration of currentTemp in another file or below in the same
file: extern int currentTemp;.
The use of extern in a variable declaration is similar to the use of a function
prototype – it informs the compiler of a variable’s name and data type so that it
can be used before it is actually defined. As with function prototype
declarations, the compiler does not allocate memory when it sees an extern
variable declaration.
A variable declared inside a statement block has a scope from the declaration to
the end of the statement block. Variables declared inside a statement block are
called local variables, as they are accessible only to statements which follow
them within the same statement block. Typically, programmers will declare
variables whose scope is local to a specific function. The variable name and
value will be defined only within that function and other functions cannot
directly refer to the variable.
What happens if you have two functions which each contain local variables with
the same name? Since a variable is local to its respective functions the compiler
can distinguish between identically named variables. A variable name must be
unique within its scope.
What happens when scopes overlap? The most recently declared instance of a
variable is used. If you declare a global variable called temp outside all
statement blocks and a local variable called temp inside your main()
74
Basic Data Types
function, the compiler gives the local variable precedence inside main().
While the computer executes statements inside main()’s scope (or statement
block), temp will have the value and scope assigned to it as a local variable.
When execution passes outside main()’s scope, temp will have the value and
scope assigned to it as a global variable.
A function data type allocates memory for the type of value the function
returns. Function identifiers work differently than variables. When a function is
defined a data type for the function must be included. Instead of indicating the
amount of memory set aside for the function itself it indicates the amount of
memory the compiler needs to reserve for the value returned by the function.
For example, a function of type int returns a signed integer value and 8 bits
are reserved for the return value.
Suppose we have a function defined as follows:
void wait(int timeInSeconds);
The void keyword indicates to the compiler that the function will not return a
value; therefore, no memory is allocated for a return value.
75
Basic Data Types
Parameter data types indicate the size of memory reserved for function
parameter values. We define a data type for the parameter the function expects
to be passed when it is called. The declaration of timeInSeconds as an
int in the function declaration void wait(int timeInSeconds);
tells the compiler to allocate a single byte to hold the parameter value when the
function is called.
The void keyword can also be used in a function parameter list:
void main(void);
This indicates to the compiler that the function does not expect to receive any
parameter values when called. The compiler does not allocate any memory for
void parameters.
The C language character data type, char, stores character values and is
allocated 1 byte of memory space. Microcontrollers do not often manipulate
alphabetic information, but sometimes it is required. The most common use of
alphabetic information is reading input from a keyboard device, where each key
typed is indicated by a character value. The char type uses a single byte of
memory and stores the value of each character by storing its ASCII code.
When assigning a character value to an identifier you must place the character in
single quotes. The quotes tell the compiler that the value is a character constant
and not the name of another identifier.
char firstLetter;
firstLetter = 'a';
firstLetter = a;
Example 17: Assigning a character value
The first assignment in the example above places the ASCII value for the
character a in the memory location assigned to the firstLetter variable.
When the compiler reads the second assignment statement, it assumes that a is
the name of a second variable. If no variable called a exists the compiler will
generate an error.
76
Basic Data Types
The order in which ASCII arranges its characters is called its collating
sequence. The collating sequence is arranged so that the letters 'A' through 'Z'
are in unbroken, ascending order with the decimal values 65-90, as are the
letters 'a' through 'z' with the decimal values 97-122. In addition, the digits '0'
through '9' are in unbroken, ascending order with the decimal values 60-71. The
collating sequence allows for easy sorting of characters and the use of
characters in simple arithmetic operations.
You can specify any character in the ASCII set with a special escape sequence –
a backslash immediately followed by the octal or hexadecimal ASCII value for
the character8. This supports character values that can not be typed using a
keyboard.
You can represent common special characters using escape sequences. For
example, the escape sequence to produce the carriage return character, the
character produced when someone types the e key, is \r.
Escape sequences are useful for typing a literal character that the C compiler
might interpret in another way. For example, to type a literal single quote
8For historical reasons you can not specify a decimal ASCII value in an escape
sequence. The general format for the escape sequence is as follows:
\### // octal value
\x## // hexadecimal value
77
Basic Data Types
Integer values can be stored as int, short or long data types. The default
size for a number on most microcontrollers is 8 bits (a single byte). Therefore,
the int data type for these computers requires a single byte of storage. Some
compilers offer the ability to switch to using 16 bit integers by default. The size
of int values usually equals the natural data size of the target computer.
C allows you to manipulate both positive and negative integer values and uses
different methods to store each value in memory. Signed integer values have the
left-most bit reserved for a sign bit. The state of the sign bit indicates whether
the stored value should be treated as a positive or negative value.
The existence of a sign bit means that there are only 7 bits left in which to store
the actual value of the integer. An 8 bit signed int value can therefore range
7 7
from -2 to 2 -1. There is one more value available on the negative side of zero
because zero itself counts as a positive value.
On many traditional C platforms, the size of an int is more than 2 bytes. The
short data type helps compensate for varying sizes of int. On platforms
where an int is greater than 2 bytes, a short should be 2 bytes in size.
On platforms where an int is 1 or 2 bytes in size —most microcontrollers—
the short data type will typically occupy a single byte. This can be useful for
embedded system programmers, especially on systems which provide a switch
to “turn on” 16 bit int values. In these cases, you can maintain code
portability by using short for those values that require 8 bits and long for
values which require 16 bits.
Like the int, the short data type uses a sign bit by default and can therefore
contain negative numbers.
78
Basic Data Types
Should your program need to manipulate values larger than an int, you can
use the long data type. On most platforms the long data type reserves twice
as much memory as the int data type. On 8 bit microcontrollers the long
data type typically occupies 16 bits; this allows the representation of signed
15 15
integers ranging from -2 to 2 -1.
It is important to note that long integer values are almost always stored in a
memory block larger than the natural size for the computer. This means that
the compiler must typically generate more machine instructions when a
program uses long values. Programs will usually operate more quickly and
efficiently if they only use 8 bit data types.
Integer data types usually hold values expressed in decimal notation. It is also
possible to express an integer value in other notations. For example, the
following declarations assign the same value to their respective variables
expressed in different notations: All C compilers allow the expression of integer
values in decimal, octal and hexadecimal notation. The ability to express values
in binary notation is an enhancement to the language not available on all
compilers.
int decimalInt = 32;
// all octal values begin with 0
int octalInt = 040;
// all hex values begin with 0x
int hexadecimalInt = 0x20;
// all binary values begin with 0b
int binaryInt = 0b00100000;
Example 18: Octal, hex and binary notation
So far we have seen two general classes of simple data types: the character data
type char and the integer data types int, short and long. By default, the
79
Basic Data Types
char type holds values from 0 to 255 and does not permit any negative values.
However, int data types permits a range of both negative and positive values.
The C language allows you to modify the default behaviour of simple data types
and thereby produce char variables which can hold negative numbers and
integer variables which permit only positive values.
The data type modifiers signed and unsigned allow you to specify
whether you wish a variable to hold negative numbers or not. They instruct the
compiler whether or not to include a sign bit in the allocated memory.
By default, char variables are unsigned and cannot hold negative values.
Also by default, integer variables (int, short and long) are signed and
can hold negative values. Actually, the short and long types are not data
types, they are data type modifiers. As a result, you often see declarations such
as:
short int myShortInt;
long int myLongInt;
Because int is the default data type in C you can simply declare variables as
short and long. Some programmers insist that you should never take this
short cut. However, some compilers actually implement the short and long
as separate data types.
Place modifiers before the data type in a variable declaration. For example:
unsigned int myAge;
While many computers make extensive use of real, or floating point numbers
(numbers with digits on both sides of the decimal place) 8 bit microcontrollers
80
Basic Data Types
do not. The resources needed to store and manipulate floating point numbers
can place overwhelming demands on an 8 bit computer and usually the value
gained is not worth the resources expended. Some C compilers for 8 bit
microcontrollers offer limited support for floating point data types, but most
do not.
The fundamental data type for representing real numbers in C is the float
type. Those compilers that do offer this data type store real numbers as
floating point values – a special way of representing real numbers in computer
memory. The maximum value for the target computer is defined in a C header
file called values.h as a symbolic constant called MAXFLOAT.
C compilers generally allocate 4 bytes for a float variable – you can see why 8
bit microcontrollers might have difficulty handling such values– which provides
approximately 6 digits of precision to the right of the decimal. You can have
greater precision with the double and long double data types. Compilers
typically allocate 8 bytes for a double variable and more for a long
double. There are approximately 15 digits of precision with double values
and perhaps more from long double values.
You can assign an integer value to a floating point data type but you must
include a decimal and a 0 to the right of the decimal.
myFloatVariable = 2.0
81
8. Operators and Expressions
The chief purpose of programming is providing the computer with a set of
generalized instructions for solving problems. This concept is so important to
programming that programmers use a specific name for a set of generalized
instructions – the term algorithm. In fact, many programmers insist that
programming consists of two simple steps:
1) Choosing suitable data structures to contain and organize program
data
2) Choosing the appropriate algorithm to manipulate that data.
Once you have determined variable and function data types it is time to
examine how the functions will manipulate the data.
8.1 Operators
Variables and functions contain and pass values among program modules.
Operators allow you to perform calculations with these values. C has more
operators than most other programming languages.
When you write a program in any language a significant portion of the program
is dedicated to doing simple data manipulation such as incrementing or
decrementing counters and multiplying or dividing a variable by a number. In
most languages these simple manipulations require a statement of some length
or more than one statement. C encapsulates many of the most common simple
data manipulations in its operator set.
For example, consider incrementing a counter. In most programming languages
the following statement is required to increment a counter.
counter = counter + 1;
The original purpose of the increment operator was to create faster and more
efficient code. Most computers have a low level hardware instruction which
performs a simple increment upon a value. This instruction uses less resources
83
Operators and Expressions
than the instructions required to add two numbers together and assign the
result to a third which is the case in the first example.
Modern compilers are quite sophisticated, especially in the optimization of code
during translation to machine language. Most compilers will see the first
example written in a program and translate it into the speedier machine-level
increment. Programmers who care about readability and clarity will insist upon
using the syntax of the first example and allow the compiler to generate the
faster and more efficient code.
Recent criticisms of C describe the operator set as overly large. It is true that
badly written C code tends to rely on the effects and side effects of operators,
making it very difficult to read and debug. Often these problems are a function
of bad programming style.
8.2 C Expressions
All C expressions have values9 which your program uses when the expressions
are evaluated. In the previous example, the value of the single expression in the
statement is 5. Operators work by acting upon the expression values. For
example, the + operator takes the value from one expression and adds it to the
value of another expression:
2 + 3;
9This is not always the case. A call to a void function has, by definition, no value.
However in practical terms an expression always evaluates to some value.
84
Operators and Expressions
The combination of two expressions (2 and 3) with the addition operator forms
a single, larger expression. When the computer evaluates the entire expression,
its value consists of the sum of the two smaller expression values joined by the
addition operator, the value 5.
8.2.1 Binding
How does the compiler determine which expressions apply to each operator in
a program’s statements? The rules which govern operator behaviour specify the
number of expressions the operator requires – we indicate this relationship by
saying that an operator binds to a number of expressions. For example, an
operator that manipulates the value of a single expression binds to a single
expression.
Operators that bind to a single expression are called unary operators. Some unary
operators bind to the expression to their immediate right – these are called
prefix unary operators where the operator act as a prefix to the bound
expressions. Other unary operators bind to the expression to their immediate
left. These are called postfix unary operators. For example:
a[6]; //postfix unary operator
a++; //postfix unary operator
++a; //prefix unary operator
&a; //prefix unary operator
Example 20: Postfix and prefix unary operators
Operators that bind to two expressions are called binary operators. Binary
operators bind the expressions located to their immediate left and right. For
example, the addition operator used in our previous example is a binary
operator and uses the general form a+b.
a * b; //multiply two expressions
a / b; //divide two expressions
a - b; //subtract one expression from another
a + b; //add two expressions
a >> b; //shift bits right
Example 21: Sample binary operators
85
Operators and Expressions
The compiler evaluates expression a, if it is true (non-zero) then the value of the
entire expression is the value of expression b. If expression a is false (zero),
then the value of the entire expression is the value of expression c.
Statements often contain more than one operator. For example, consider the
conversion from degrees Celsius to degrees Fahrenheit. The equation for this
operation is:
F $ % (C$&2) ' 2 ( 30
C$ % ( F $&30) ) 2 ( 2
The equivalent expressions in C can be written as:
Fahrenheit = Celsius - 2 * 2 + 30;
Celsius = Fahrenheit - 30 / 2 + 2;
Example 23: Combining operators in a statement
86
Operators and Expressions
Consider Example 23: the first rule tells us that multiplication and division are
done before addition and subtraction. This means that the expression 2 * 2
will be evaluated first. The result of this expression will then bind with the -
operator, along with + 30. Putting brackets around the first part of the
calculation explicitly demonstrates the desired order of operations:
Fahrenheit = (Celsius - 2) * 2 + 30;
The revised statement implements the second rule: the brackets leave the
compiler with no doubt about which part of the calculation to perform first.
More importantly, the brackets explicitly depict the programmer’s intentions.
The most common use for the comma operator is inside the initialization or
condition expression of for or while loops.
C allows statements with more than one assignment operation. For example,
you can initialise a number of counter variables with a single statement in C.
The parentheses in the second line show how each operator in the statement
naturally binds.
counterOne = counterTwo = counterThree = 1;
(counterOne = (counterTwo = (counterThree = 1)));
counterOne = (counterTwo = counterThree) = 1;
Example 25: Combining assignment operators in statements
87
Operators and Expressions
The division operator, /, returns the whole quotient. Any fractional portion of
the division is truncated and lost. Remember that truncation is not the same as
rounding. For example, the expression 5/2 returns the value 2, not 2.5 or 3.
Truncation only occurs during integer division. If floating point numbers are
involved in the operation, then the division operator will perform a floating
point divide.
The modulus operator, %, returns the remainder of a division operation. For
example, the expression 5%2 returns the value 1. Thus, if you have calculated a
total number of months, you can easily convert to the number of years and
number of extra months using the following expressions:
years = totalMonths/12;
extraMonths = totalMonths%12;
Example 28: Differentiating the division and modulus operators
88
Operators and Expressions
The increment and decrement operators are unary operators with higher
precedence than the arithmetic operators. The increment operator, ++, adds
one to its binding identifier, while the decrement operator, --, subtracts one.
You can use the increment and decrement operators in two ways: prefix and
postfix. All of the following expressions are valid.
++counter; //prefix increment
counter++; //postfix increment
--counter; //prefix decrement
counter--; //postfix decrement
Example 29: Prefix and postfix notation for increment and decrement
Because the increment and decrement operators modify the value of the
identifier they bind to, they can not be bound to complex expressions. The
following statement is not valid: ++(a + b);.
It is essential to understand how various forms of the increment and decrement
operators return values.
The first line of code assigns a value of 0 to the variable counter. The second
line assigns the value of an increment expression to j. Because we used the
postfix increment operator, the expression returns a value of 0 which is the
current value of the counter variable. counter is then incremented by 1.
The postfix decrement operator in the fifth line forces the expression to return
the current value of counter and then decrements counter by one. This
89
Operators and Expressions
With the prefix notation, the second line of code sets j to 1 instead of 0. The
increment operation is performed first, then j is assigned the new value of
counter.
NOTE
If you are using increment and/or decrement operators in a complex expression you
should carefully document their use. Side effects caused by increment and
decrement operators can make reading and debugging code extremely difficult.
The basic assignment operator, =, assigns the value of its right hand expression
to the identifier on its left hand side. C also provides specialised assignment
operators.
Many programs include statements such as total = total +
subTotal;. Programmers constantly perform operations upon a variable’s
value and then reassign a new value to that variable.
This simple type of calculation is so prevalent in programming that the authors
of C decided to provide a class of operators to act as short cuts. You can
combine any arithmetic or bitwise operator with an assignment operator.
90
Operators and Expressions
The assignment operator in the first line of Example 32 takes the value of its
right hand expression, the value of subTotal, and increments its left hand
identifier by that amount. The expression in the second line multiplies the right
hand expression by the left hand identifier’s value, and then reassigns the new
result to the left hand side.
NOTE
Short cut assignments can be obscure and difficult to follow for anyone else reading
your program code. Make sure to insert comments to explain the use of the
statement.
Most programs depend on the ability to compare values. Are two values equal?
Does a variable have a positive value? Are two expressions true? All these
questions are typically posed in computer programs. C provides three sets of
operators you can use to test and return the truth value of an expression:
equality operators, relational operators and logical operators.
91
Operators and Expressions
The == operator returns 1 if its two binding expressions are identical in value.
In the example: (PortA.1 == 1) assume that PortA.1 represents the
value of bit one in the port defined as PortA. The expression returns 1 if
PortA.1 has the value 1 and 0 if it does not.
NOTE
Do not confuse the == equality operator with the = assignment operator!
PortA.1==1 tests bit 1 of Port A to see if it is set while PortA.1=1 sets bit 1 to 1.
The equality operator is often used as part of a statement which controls the
execution of a loop or a conditional action.
while (PortA.1 == 1) {
// statements
}
if (counter == 10){
// statements
}
Example 35: Using the equality operator in control structures
The != operator returns 1 if its binding expressions are not identical in value. In
the following example, the expression returns 1 if bit 0 of PortA is not
cleared.
while (PortA.0 != 0){
// statements
}
Example 36: The inequality operator
Relational operators return 1 when they correctly express the relative values of
their binding expressions.
92
Operators and Expressions
The less-than operator, <, returns 1 if the left hand side expression’s value is
less than the right hand side expression’s value. The expression (2<3) returns
TRUE while the expression (3<2) returns 0.
The greater-than operator, >, returns 1 if the left hand side expression is greater
than the right hand side expression. Therefore, the expression (2>3) returns a
value of 0 while the expression (3>2) returns a value of 1.
Both the less-than and greater-than operators have an “or equal to” version.
Both the less-than-or-equal, <=, and greater-than-or-equal, >=, operators
return 1 if their left hand sides are equal to their right hand side. For example,
(3<=3) returns 1 while (3<3) returns 0.
The unary logical NOT operator, !, returns 1 if its binding expression’s value is
0; otherwise, it returns 0.
The binary logical AND operator, &&, returns 1 if both of its binding
expressions return non-zero values; otherwise, it returns 0. Consider the
following example:
#define size 30
int i=0;
char s[size];
void main() {
for (i=0;(s[i]!=0)&&(i<size);i++)
putc(s[i]);
}
Example 37: Logical NOT and AND operators
93
Operators and Expressions
While the logical expression appears complex, it is actually quite simple. In fact,
a good compiler will flag the entire if structure as unreachable or dead code.
Why? Because of C’s short-circuit evaluation ability. As soon as the computer
begins evaluating the logical expression, it determines that PortA.0 has a 0
value. It knows that this will make the outermost logical AND expression false;
therefore, it does not evaluate the any more of the expression.
Careful design can exploit short-circuit evaluation. For example, you might
want to avoid calling the function to read a key from the keyboard buffer if no
key has been pressed. The following construct shows how you can use short-
circuiting of logical expressions to achieve this:
if ( (keyPressed() == TRUE) && ((myKey = getch()) == 0) ) {
// special key has been pressed
specialKey = TRUE;
myKey = getch();
}
Example 40: Using short-circuit evaluation
If the keyPressed() function returns 0, a key has not been pressed and the
logical expression will short circuit and avoid the call to getch() – the
compiler knows that if any term in a logical AND expression is false the entire
expression is false.
94
Operators and Expressions
Notice that the second term in the logical AND expression serves two
purposes. When keyPressed()returns 1 a key has been pressed, the second
term gets a value for myKey and decides whether the pressed key was a special
key (the special character NUL) or not. If getch() returns 0 then we make
another call to getch() to retrieve the identity of the special key.
Bit level or bitwise operators are operators which evaluate and manipulate data
at the bit level. These operators are especially useful to embedded system
programmers. They fall into two main classes: logical operators and shift
operators.
C supports one unary and three binary bitwise logical operators. Each of these
operators act only upon values stored in the char, short int, int and
long int data types.
NOTE
Binary logical operators perform data promotion on operands to ensure both are of
equivalent size. If you specify one short operand and one long operand, the
compiler will widen the short to occupy the long 16 bits. This expression will
return its value as a 16 bit integer.
The AND operation is easier to see if your compiler has an extension which
permits data values in binary:
95
Operators and Expressions
int x=0b00000101,
y=0b00000111,
z;
z = x & y; // z gets the value 00000101, or 5
Example 42: Using the AND bitwise operator with binary values
The resulting value for z has a bit set in every position where both x and y
have a bit set, and bits cleared in every other position.
NOTE
The bitwise AND, &, is not the same operation as the logical AND, &&.
The value for z has a bit set in every position where either x or y have a bit set,
and bits unset in every other position. This produces a result with all the bits
that either operand has set.
NOTE
The bitwise OR, |, is not the same operation as the logical OR, ||.
96
Operators and Expressions
Shift Right
The right shift operator shifts the data right by the specified number of
positions. Bits shifted out the right side disappear. With unsigned integer values
0s are shifted in at the high end as necessary. For signed types the values shifted
in is machine dependant. The binary number is shifted right by number bits:
x >> number;. Right shifting a binary number by n places is the same as an
n
integer division by 2 .
porta = 0b10000000;
while (porta.7 != 1){
porta >> 1;
}
while (porta.0 != 1){
porta << 1;
}
Example 46: Shifting bits left and right
Shift Left
The left shift operator shifts the data right by the specified number of
positions. Bits shifted out the left side disappear and new bits coming in are
zeroes. The binary number is shifted left by number bits: x << number;.
n
Left shifting a binary number is equivalent to multiplying it by 2 .
97
9. Control Structures
One of the most important features of any programming language is the ability
to control the way in which program statements are executed. Normally, a
computer executes all the statements in your program sequentially. It will start
at the first statement in the main() function and execute each statement and
function call until it finishes executing the last statement in main().
Sometimes you want the computer to deviate from sequential execution.
Control structures allow the making of decisions about which instructions to
execute. You can also use control structures to repeat a set of instructions.
The C language contains a variety of powerful and flexible11 control structures.
In general, control structures fall into two groups – those that branch and those
that loop.
11 One problem stems from the overburdening of control structures with statements
99
Control Structures
testVariable = 1;
while (testVariable) {
// some statements
}
Example 47: Controlling loops without using logical operators
As long as the testVariable retains the value of 1 the loop will continue.
At some point a statement inside the loop might set the variable’s value to 0,
causing the loop to terminate before the next cycle.
C provides two structures the programmer can use to support different types of
decisions. Decision structures test an expression to determine which statement
or statement block to execute.
if
The if structure specifies a specific execution path based on the value of a
particular expression. The following example shows the general form for this
structure:
if (expression) {
// if expression is true do these statements
}
Example 48: if and else structure
100
Control Structures
else
There is an additional optional component of the if structure which executes a
set of statements when the if tested expression is false. This additional
component is the else structure.
The else structure must be the first statement following an if structure.
When the if condition evaluates to 0, execution will pass directly to the else
structure.
if (expression) {
// if expression is true execute these statements
}
else {
// if expression is false execute these statements
}
Example 50: The else statement
Like the if, else needs a complementing statement or statement block which
provides its semicolon terminator. Unlike the if, else does not test the value
of an expression.
If the value of PortA_DDR bit 0 it not 1 and the value of PortA_DDR bit 1
is not 1 then PortA bit 0 and PortA bit 1 are set to 1. You could encapsulate
the example into a single logical expression such as:
101
Control Structures
An else always matches with the nearest unmatched if. A common problem
with if..else structures arises from a set of statements such as:
if (a)
b=1;
if (!a)
b=2;
else
b=3;
Example 53: Matching if and else statements
If a has the value 1, what value will b have after these statements? What if a
has the value of 0? Will b ever have the value of 1 after these statements? The
answer to all these questions depends entirely upon a syntactic question – which
if statement does the else belong to? In C an else structure always
belongs to the nearest if not already associated with an else.
Using this rule, we can see that the else associates with the second if, not
the first. Therefore, if a has the value 1, b is first given the value 1 by the
statement if (a) b=1; . When the next if statement is evaluated b is given
the value 3 because the conditional statement is !a which evaluates to !1 or 0.
The else statement is executed and b is assigned the value 3.
Good programmers include braces around the complementing statements of
if and else structures in order to make code easier to read, debug and
modify. Applying this principal to the previous example makes the situation
much more obvious.
if (a) {
b= 1;
if (!a) {
b=2;
}
else {
b=3;
}
}
Example 54: Using braces to clarify the combination of if and else
102
Control Structures
if (!a) {
b=2;
} else {
b=3;
}
Example 55: An alternate format for showing if else pairing
if..else structures let you make a decision between two paths based on the
truth value of a single expression. You can use a series of nested if statements
to test a variable for a series of possible values, but C includes a statement
which tests many possible variable values: the switch-case structure. This
structure lets you switch between several different possible paths of code to
execute.
The switch-case structure has a switch value or expression upon which
the branching of code execution is based. Statement execution depends upon
the different cases provided for possible values of the switch. The general
format for the switch-case structure looks like:
switch (expression) {
case possibleValue :
statement;
statement;
break;
case anotherPossibleValue :
statement;
break;
}
Example 56: The switch..case structure
Each value for the switch expression is preceded by the keyword case, and
followed by a colon. When the switch is executed each case is tested in
turn. If a case value does not match the evaluated value of the switch
expression, all code is ignored until the next case statement is encountered or
the end of the switch block is reached. If a case value matches the
103
Control Structures
switch value, execution begins with the statement following the matching
case.
Once a case value matches the switch expression every subsequent line is
executed, including those after subsequent case statements. Most of the time
this is not the desired action; you want the computer to execute the code for
only one case. To avoid the “fall through” effect of the C switch-case
structure, you must place a break statement at the end of each case.
Sometimes you may wish to take advantage of the “fall through effect”.
Consider the following simple example which enables a decimal point if specific
digits are being displayed:
switch (digit) {
case 1:
addpt = 0;
case 2:
case 4:
addpt = 0x80;
break;
case 5:
addpr = 0;
break;
}
Example 57: Using the fall-through effect with switch statements
Notice that if the second or fourth digit is being displayed addpt is set to
0x80. This variable addpt is a flag which allows the display of a decimal
point which delineates between minutes, seconds and fractions of a second.
The “fall-through” effect is used with case 2: where it falls through to case
4:. If the first or fifth digit is being displayed addpt is set to 0 indicating that
no decimal point is displayed.
NOTE
Notice that there is a break statement after the 5 case value, even though this is
not compulsory. It is good programming practice which helps in the event that you
modify the structure by adding additional cases. The existing break can help
prevent debugging problems.
104
Control Structures
switch (digit) {
case 1, 5:
addpt=0;
break;
case 2, 4:
addpt = 0x80;
break;
}
Example 58: Multiple case enhancement
Many C programmers have strong objections to the use of the goto statement.
The goto remains a holdover from early programming languages without
sophisticated control flow. Because C provides a variety of useful control
structures, you should not need to use goto statements.
If you do use a goto statement, be extremely careful and document it well.
Consider the following example:
void main(void){
if (time < limit)
time++;
105
Control Structures
else
goto Done;
Done:
}
Example 60: The goto statement
NOTE
Make sure that when you use a goto statement you document where the target is.
This can help prevent later debugging problems. As a general rule, you should write
code which uses some control flow method other than a goto.
You may have noticed a similarity between the goto statement and the
switch..case statement. This similarity in form comes with a similarity in
function. The switch-case behaves like a goto or jump table where each
case is a label.
NOTE
It is essential to remember that the switch-case operates like a jump table. The
fall-through effect of case statements can be useful, but a source of debugging
problems if break statements are not used properly.
The key component of any loop structure is the control expression. At some
point in each iteration, the control expression is tested. If the control
expression evaluates to 0 program execution passes to the first statement
following the loop structure. If the expression evaluates to 1, execution
continues within the loop structure statement block.
106
Control Structures
NOTE
The only control you have over loop structures is the control expression. The
compiler cannot tell you if a loop has a control expression which will never evaluate
to 0. In embedded systems programming, infinite loops are often used to keep the
program running constantly.
The simplest C loop structure is the while loop. Here is the general form of
the while loop:
while (controlExpression) {
// statement block
}
Example 61: The while loop syntax
In a while loop the control expression is at the top of the structure. The
while loop evaluates the control expression before every loop iteration –
including the first loop iteration. Therefore, if a control expression evaluates to
0 the first time the while loop is encountered, the statements inside the
structure will never execute.
The do loop tests the control expression value after every loop iteration. The
general form of a do loop is as follows:
do {
// statement block
} while (controlExpression); // close do
Example 62: The do loop syntax
Because the do loop tests the control expression after every iteration of the loop
the statement block will always execute at least once, even if the control
expression evaluates to 0 when the loop is first entered.
NOTE
The do loop is one of the few cases where keywords belong at the end of a
statement block. Because of this, you should place the while expression on the
same line as the loop’s closing brace and put a comment after the while explaining
that it closes a do structure.
107
Control Structures
The most complex and flexible looping structure available in C is the for loop.
The for loop incorporates statements which alter variables used in the control
expression. The example on the left shows a while loop and that on the right
shows the equivalent for loop:
while loop for loop
counter=0;
while (counter<=10){ for
//statements (counter=0;counter<=10;counter++){
counter ++; //statements
} }
Example 63: Comparing the while and for loops
Each for loop has up to three expressions which determine its operation. The
following example shows general for loop syntax. Notice that the three
expressions in the for loop argument parentheses are separated with
semicolons.
for ( initialize; control; increment) {
// statement block
}
Example 64: Using the for loop
The first expression, initialize;, provides initial values for variables used
in the control expression. When the for loop is first encountered this
initialization expression is executed.
The second expression, control;, is the same as the control expression used
in the while and do loops. Like the while loop, the for loop control
expression is checked before each loop iteration. If the control expression
evaluates to 1, the loop statement block is executed; otherwise, execution passes
to the first statement following the loop. In Example 63, the control
expression tests to see if counter is less than or equal to 10. As long as the
expression returns 1, the loop will iterate.
108
Control Structures
NOTE
You can omit any of the for loop expressions, but you must include the semicolon
separators so the compiler knows which expressions have been left out. If the
control expression is omitted the for loop will not stop.
Use a break statement to completely break out of a loop. The most common
place for a break statement is inside a switch-case structure. However,
this is not the only place it can be used. You can also use a break statement to
break out of any looping structure in C. When a break is encountered
inside a looping structure, the loop terminates immediately and execution
passes to the statement following the loop.
You may wish to jump to the next iteration of a loop without breaking out of
the loop entirely. A continue statement will allow you to do this. When a
continue statement is encountered inside a looping structure, execution
passes immediately to the end of the loop statement block. Because execution
passes to the end of the loop statement block, the next action is the evaluation
of the loop control expression.
If continue is used with a while or for loop, execution jumps from the
end of the statement block to the control expression at the top of the loop. If
used with a do loop, execution passes from the end of the statement block to
the control expression at the bottom of the loop. In all cases, the effect is the
109
Control Structures
110
10. Functions
Functions are the basic building blocks for all C programs.
There are some restrictions for the creation of C functions. Each function in a
C program must be self-contained. You may not define a function within
another function. Also, you may not extend the definition for a function across
more than one file – when you define a function it must be contained within a
single file.
10.1 main()
Every C program has at least one function called main(). When the target
computer runs your program, program execution generally begins with the first
statement of the main() function.
In reality, program execution usually begins with initialization code quietly
linked into the program. The C compiler may generate this automatically, based
on information contained within the C program, or it may link in a standard
library. But the compiler cannot know the entire state of a target embedded
system before invoking main(). In a desktop system, the OS itself covers
most of the hardware details. In an embedded system without an OS, you may
be obliged to write intialization code to establish the running state of the MCU
before transferring control to main().
There it little to stop you from performing such initialization within main()
itself.
Any function in a C program can execute, or call, any other function. Typically,
the main() function calls one or more other functions which may in turn call
other functions. There is a restriction on the calling of functions: a function
cannot call a function which it does not recognize. There are two different
techniques for allowing a function to be recognised.
111
Functions
! Provide the full definition for a function before the part of the program which
calls it. This method has the following complications:
1) C lets you combine functions from several files into a single program –
how would you alert your program to functions found in another file?
2) It is possible to have two functions call each other – which of these
functions would you define first?
" Use a function prototype to alert the compiler about a function before you actually
provide its definition. This method has several advantages which are explained
in the following section.
The syntax for a function call in C is the function name and a list of parameters
surrounded by parentheses. When the C compiler encounters an identifier
followed by a left parenthesis it knows that the identifier represents a function.
For this reason, function names always include a pair of parentheses, for
example main().
If you have defined a function called sum() to add two integers and return the
result you can assign the return value of sum() to a variable with the following
line:
sumResult = sum(firstNum, secondNum);
Notice that the function call to sum() fits into an assignment expression in
the same way as a variable or variable expression. You can place a call to a
function any place an expression can occur. For example:
areaRectangle = height * sum(length, width);
You can include expressions in the parameter list in place of variable names as
long as the expressions evaluate to an appropriate data type. For example, a
formula to calculate the hypotenuse of a right-angled triangle could look like
this:
hypotenuse = sum((sideOne*sideOne), (sideTwo*sideTwo));
Notice that there are parentheses around each expression in the parameter list.
This is for the sake of clarity but the is not required because of the extremely
low precedence of the comma operator which separates elements in the
parameter list.
112
Functions
Function prototypes let you take advantage of functions in other files, even if
the files have already been compiled. Pre-compiled files of functions are called
object libraries and most C development environments make extensive use of
them. The prototypes for functions in a pre-compiled library are often
contained in a header file. This allows you to take advantage of pre-compiled
functions without having to worry about compiling or maintaining them.
For example, traditional C development environments provide standard library
functions to handle user input and output. A header file called stdio.h
contains the prototypes for these functions. Since very few 8 bit platforms
provide resources for user input and output, these library functions are not
typically needed. 8 bit microcontroller libraries are often for such things as
A/D, serial and peripheral support. For example, the library lcd8.h contains
functions which write data to the LCD, control the LCD and initialize the
LCD. If you use any functions from the LCD library, you must include the
appropriate header file in your program:
113
Functions
#include <lcd8.h>
The first element in the function prototype is a data type. This tells the
compiler the data type of the function’s return value. The type of the return
value informs the compiler how much memory to allocate in RAM to hold that
value. It also ensures that you use the function properly in expressions
elsewhere in the program. In this case, you can put a call to sum() in any
expression where an int value could occur.
After the function data type comes the name of the function. This identifier is
entered in the symbol table and associated with an address which contains the
beginning of the function’s executable code. When your program calls a
function, execution jumps to the address associated with the function name.
Parentheses following an identifier inform the compiler that you are declaring a
function, not a variable. You must include the parentheses, even if a function
accepts no parameters. If there are function parameters each one should include
a data type and a meaningful name. It is only necessary to include the data type
of each parameter in a function prototype declaration. For example:
int sum(int, int);
However, this form is unclear. Including meaningful names for each parameter
increases program readability. It also helps to understand the order in which a
function reads parameters. For example, suppose you encounter the function
prototype:
int portControl(int, int);
114
Functions
the second parameter the data direction values. A prototype like the following is
much clearer:
int portControl(int portLoc, int DDR);
Function prototype names need not be those used in the function definition,
but using the same names helps to avoid confusion.
When most compilers encounter such a declaration, they assume that the
function will return an int value. In embedded systems this practice wastes
memory resources because the space for the int is reserved.. To avoid this
problem use the void keyword:
void wait();
The void keyword tells the compiler explicitly that the function will not return
a value so no memory is allocated for a return value. You can also use void
inside the parentheses of a function prototype to explicitly declare that the
function accepts no parameters:
void wait(void);
It is best to include the void keyword whenever you have a function without a
return value or parameters. This clarifies the purpose of your functions.
12 Some C programmers insist that functions which just produce side effects should
115
Functions
116
Functions
Most functions required information from the code that calls them. The most
common way to pass information to a function is through its list of parameters.
You can also pass information to a function using of global variables – any
variable in global program space can be used by any program function. It is
good programming practice to avoid the use of global variables if possible.
When you call a function, parameter values are passed to the function. The
compiler will set aside the appropriate amount of memory to hold these values.
This is why it is important to specify function parameter data types in the
function prototype. The following code clarifies this.
void change(int num) {
num = 4;
}
void main(void) {
int val = 2;
change(val); //send value of val to change()
val += val; // val = 2 + 2 = 4
}
Example 68: Passing data to a function by value
What value will val have after last line in main()? The answer is 4, not 8.
When main() calls change(), the value of val is passed to the function,
not its address. The function stores the value in the memory location reserved
for its parameter, num. The value at this memory location is changed by the
function. change() but the change has no effect on the value stored in val
because val’s address is not known. This method of parameter handling is
called passing parameters by value.
How can you write a function which can change variables belonging to its
calling function? A variable value can be changed by accessing the variable’s
address to change its value. Variables are accessed by their addresses using
pointers. A pointer is a data type which stores an address. A pointer can be
used like any other data type, therefore you can write a function which accepts a
pointer as a parameter. The following is another version of the example from
the previous section.
117
Functions
In this example, val will have a value of 8 after the last line in main().
The definition of change() includes a pointer to an integer parameter,
instead of the integer parameter itself. When main() calls change the
function creates a copy of val’s address in memory, not its value. The
assignment performed by the function uses the dereference operator, *. Instead
of assigning the value 4 to num, the dereference operator assigns 4 to the
memory location corresponding to val’s address which is stored in the pointer
num. The dereference operator reads the value of its binding identifier as an
address and then represents the value stored at that address.
NOTE
Notice that in the call to change() you specify the address of val with the unary
address operator &. The address operator returns the address in memory which
stores the value of its binding identifier.
However, it is good practice to specify that the function has no parameters with
the void parameter type:
int myFunc(void)
118
Functions
however, you should be careful when including many functions which produce
side effects.
Functions which produce extensive side effects are harder to maintain and
debug, especially for members of a development team. To safely use abstract
functions, you only need to know the data which goes in and comes out – the
function interface. When a function produces side effects, you need to know
about the interface and behaviour to use it safely.
119
11. Complex Data Types
This section introduces several complex C data types. Complex data types
include pointer, arrays, enumerated types, unions, and structures. A solid
understanding of pointers and arrays in particular is absolutely vital to an
effective use of the C language.
11.1 Pointers
The elementary C data types, char, int and float, store values which are
used directly. Unlike these basic types, the pointer data type represents values
used indirectly.
All data stored in computer memory is stored as a series of ones and zeroes. C
data types act as filters which interpret these ones and zeroes. When the
computer evaluates a pointer value, it reads the ones and zeroes as a memory
address. Consider computer memory a single long street and each block of
memory as a building, then a pointer contains a number which identifies a
specific building on the street.
A pointer value can be interpreted as a number just as a real address could.
Because of the pointer’s special nature, the computer knows to interpret that
number as an address in memory.
NOTE
Pointers can be difficult to understand. A pointer contains a numeric value, the
difference is in the way the value is interpreted: as an address in memory.
The declaration of a pointer data type must specify the type of data it can point
to. Consider the following statement which declares a pointer able to point at
any data of type int: int * myIntPtr;
When you declare a pointer, the compiler assigns it the value NULL – this
signifies that it points to no valid address.
A pointer’s data type is important. The computer uses the data type to
determine the size of the memory block the pointer points to. For example, on
121
Complex Data Types
It is possible to use the address operator with a pointer. In these cases, the
address operator returns the memory address where the pointer’s value is
stored. This double indirection is described as a pointer to a pointer which is
also called a handle.
122
Complex Data Types
Why do these last two logical expressions use parentheses? Because of C’s
precedence rules.
In the first case the == equality operator has a higher precedence than the +=
assignment operator, so the parentheses ensure that both assignments are
performed before the equality evaluation.
In the second case the postfix ++ increment operator has a higher precedence
than the * dereference operator. To perform the dereference first, we need to
place parentheses around its sub-expression. Without these parentheses, the
increment operator would increment the pointer instead of the what the pointer
points at! The results of this side effect are not obvious until the next time you
use myIntPtr.
NOTE
It is essential to remember that * and & are operators and that careless use of them
can create bugs which are difficult to locate. Always include parentheses and
comments to facilitate debugging pointer problems.
123
Complex Data Types
The address operator has a higher precedence than the assignment operator.
The address of myInt is returned before the assignment to myIntPtr.
Notice that we do not need to initialize myInt in order to point myIntPtr
at it. The declaration of myInt sets aside a specific memory block for myInt.
11.2 Arrays
The postfix subscript operator, [], is used to refer to an array element. The
operator binds to an identifier which returns an address in memory. The integer
expression inside the square brackets is evaluated and this number determines
how many units of memory should be moved past the bound identifier address
value.
14 Another way to arrange related elements of data is with the struct data type.
124
Complex Data Types
How big is a unit? The size is determined from the data type of the bound
identifier expression. For example, if myIntArray is an array of int then
the expression myIntArray returns the starting address of the block of
memory occupied by the array. The expression myIntArray[2] jumps two
int sized blocks from the address returned by myIntArray.
NOTE
When you declare an array in C you must specify the number of elements it
contains. However, when you subscript an array the number in the brackets
indicates the number of elements past the first element in the array. The first
element in a C array is number 0. This is because the notation myArray[0] is
interpreted as a jump 0 elements past the first element in the myArray memory
block.
125
Complex Data Types
First we declare an array of int values and a pointer to an int. We then set
the pointer, myIntPtr, to point to the first element of the array. Notice how
to set a pointer to point at an array. You can use the expression
myIntPtr = myIntArray; because myIntArray returns the address
of the first array element.
The tricky part of Example 74 is the last statement. Pointer arithmetic allows
us to specify the int sized block of memory next to myIntPtr with the
expression myIntPtr + 1. Since we know that arrays are always stored in
contiguous blocks of memory, it follows that the int sized block of memory
next to myIntArray[0] must be myIntArray[1].
In general, the expressions *(myIntPtr + x) and myIntArray[x] are
equivalent when myIntPtr points to the first member of myIntArray[].
Because the subscript square brackets are an operator, the expressions
myIntPtr[x] and myIntArray[x] are also equivalent. The subscript
operator checks the underlying type of myIntPtr and, finding that it points
to an int, jumps over x int sized blocks.
Be careful! The apparent symmetry between pointers and arrays emerges from
the way their related operators work. Arrays and pointers are not fundamentally
the same. The first two equivalency expressions return 1, but the third may
return either 1 or 0:
*(myIntPtr + x) == myIntArray[x];
myIntPtr[x] == myIntArray[x];
// this may not be true
*(myIntPtr + x) == (myIntArray + x);
Example 75: The relationship between arrays and pointers
An array can contain pointers to other data types. The most common use for an
array of pointers is to use an array of pointers to type char which are pointed to
strings. This technique can be used to send messages to a screen. In the
15 For a useful treatment of array-pointer distinctions see Koenig’s C Traps and Pitfalls.
126
Complex Data Types
following example the array is declared in main but the array is passed to a
function where the values of the pointers are assigned.
void func1(char *p){
p[0]="Press 1 to start";
p[1]="Press 2 to continue";
p[2]="Press 3 to RESET";
p[3]="Press 4 to quit";
}
void main(void){
int val;
char *message[10];
if (val==TRUE){
func1(message);
}
else
message[0]="Status is OK";
}
Example 76: Declaring and initializing an array of pointers
The most flexible complex data types are those you define yourself. C allows
you to construct new data types in terms of those already defined.
The typedef keyword is used to define new data types. You must include an
underlying type for your new type and the name of your new type.
For example, you can create a new type called BYTE using unsigned short
int as the underlying type:
typedef unsigned int UBYTE;
typedef unsigned long UWORD;
UBYTE Var1; // new variable of type UBYTE
UWORD Var2; // new variable of type UWORD
Example 77: Using typedef to define a new data type
With typedef, the name of the new type is in the same location as the
variable name in a simple variable declaration. For example, what new types are
created with the following declaration?
struct coord_tag {
int xVal;
int yVal;
127
Complex Data Types
};
typedef struct coord_tag COORD;
COORD and LOC are the new types. In this case, coord_tag and
location_tag are the tags for the new structures. Tags are discussed in
the next section. Example 78 shows two different techniques for using
typedef with struct.
Once you have defined a new type using typedef, it can be used like any C
data type.
"# You can use sizeof() to retrieve memory size requirements:
byteSize = sizeof(UBYTE);
structureSize = sizeof(COORD);
The most straightforward complex data type is the enumerated data type,
declared as type enum. The enum type is used to represent a set of possible
values. The traditional example for this type is the days of the week:
enum WEEK { Su, Mo, Tu, We, Th, Fr, Sa } dayOfWeek;
This declaration creates an enumerated type called WEEK, provides seven
possible values, and declares a variable called dayOfWeek of this new
enumerated type. You can also separate this process into two declarations:
enum WEEK { Mo, Tu, We, Th, Fr, Sa, Su };
enum WEEK dayOfWeek;
128
Complex Data Types
The label WEEK is not a new type, it is called a tag. The second line of code in
the previous example requires the enum keyword for the declaration of
dayOfWeek. To use WEEK as a user defined data type you require a
declaration such as:
typedef enum { Mo, Tu, We, Th, Fr, Sa, Su } WEEK;
You can declare the enumerated variable dayOfWeek on a single line. Since
the enumerated list tag WEEK represents the list itself we do not need to include
it in the declaration.
enum { Su, Mo, Tu, We, Th, Fr, Sa } dayOfWeek;
The tag is useful as it can represent a list of enumerated elements to declare
more than one variable of that type.
enum WEEK { Su, Mo, Tu, We, Th, Fr, Sa } dayOfWeek;
enum WEEK dayOFWeek;
enum WEEK payDay = Th;
enum WEEK groceryDay = Sa;
Example 79: Declaring multiple variables of the same enumerated type
129
Complex Data Types
NOTE
Ensure that enumerated variables have the values you expect them to have by
performing your own boundary checking.
By default, the compiler supplies a range of integer values beginning with 0 for
any list of enumerated elements. This default behaviour can be modified in two
ways:
1) Specify values for each enumerated element. The following example is from
the COP8SAA7 WATCHDOG service register WDSVR. Bits 6 and 7 of
this register select an upper limit to the service window which selects
WATCHDOG service time.
enum WDWinSel { Bit7 = 7,
Bit6 = 6};
Example 82: Specifying integer values for enumerated elements
NOTE
Since character constants are stored as integer values they can be specified as
values in an enumerated list. enum DIGITS {one=‘1’, two=‘2’,
three=‘3’}; will store the appropriate integer values of machine character set
(usually ASCII) for each digit specified in the element list.
130
Complex Data Types
11.5 Structures
131
Complex Data Types
If you create a structure which is used several times in your program or you are
using more than one kind of structure, it is good practice to create structure
types using typedef.
typedef struct Display_tag {
int DisplaySelected;
int hundreds;
int tens;
int ones;
char AorP;
}DISPLAY;
DISPLAY currentTime;
DISPLAY alarmTime;
Example 87: Using typedef to clarify structure declaration
Remember that you can declare a pointer to a struct before the struct
itself is declared. The example declares a new structure type called DISPLAY.
The use of typedef helps at other points in the program when you need a
structure instance.
C includes two binary operators which allow access to structure members: the
dot operator, ., and the structure pointer operator, ->. In each case, the
binding identifier to the left of the operator indicates the structure and the
binding identifier to the right of the operator indicates the element within that
structure.
Once a struct variable is declared you can use the dot operator to reference
an element of the structure. The following assigns values to the elements of the
currentTime variable for the structure defined in Example 87.
currentTime.DisplaySelected = 1;
currentTime.hundreds = 9;
currentTime.AorP= ”A”;
alarmTime.AorP = currentTime.AorP;
Example 88: Accessing elements in a structure
132
Complex Data Types
Structures are often manipulated using pointers. C has an operator specially for
this purpose; the structure pointer operator. In order to use a pointer to access
members of a structure the pointer must first be pointed at the structure
instance. The following example points Display_Ptr to alarmTime and
then accesses the elements of alarmTime.
struct Display_tag * Display_Ptr;
struct Display_tag {
int DisplaySelected;
int hundreds;
int tens;
int ones;
char AorP;
}alarmTime;
Using bit fields allows the declaration of a structure which takes up the
minimum amount of space. A bit field contains a specified number of bits, it is
a member of a structure and is accessed like any other structure member. The
following example for the Motorola MC68HC705C8 defines the Timer Control
Register (TCR) bits as bit fields in the structure called TCR.
133
Complex Data Types
struct reg_tag {
int ICIE : 1; // field ICIE 1 bit long
int OCIE : 1; // field OCIE 1 bit long
int notUsed : 3 = 0; //notUsed is 3 bits and set to 0
int IEDG : 1; // field IEDG 1 bit long
int OLVL : 1; // field OLVL 1 bit long
} TCR;
Example 90: Bit fields in structures
Storage of bit fields in memory varies from one compiler to another. Some
compilers cannot store a bit field over a word boundary. In this case the
following structure would place the second field entirely in a separate word of
memory from the first:
struct {
unsigned int shortElement : 1; // 1 bit in size
unsigned int longElement : 8; // 8 bits in size
} myBitField;
Example 92: Compiler dependant storage of bit fields
The order in which the compiler stores elements in a structure bit field also
varies from compiler to compiler. Some compilers may use the first word of
allocated memory to hold longElement in the previous structure. Other
compilers may use the first word to contain shortElement and part or
none of longElement.
Bit field elements behave exactly as an int of the same size. Thus an element
occupying a single bit could have an integer value of either 0 or 1, while an
element occupying two bits could have any integer value ranging from 0 to 3.
134
Complex Data Types
You can use each field in calculations and expressions exactly as you would an
int.
11.6 Unions
The format of union resembles that of the structure. You can identify a union
with a tag name. To make your union a data type you must use the typedef
keyword. In the following example, a new type called share is created.
typedef union share_tage {
int asInt;
char asChar;
} share_type; //share_type is the data type
share_type share; //share is the variable name
Example 94: Using typedef to declare a union
135
Complex Data Types
union tagName {
int asInt;
char asChar;
short asShort;
long asLong;
int near * asNPtr;
int far * asFPtr;
struct hilo_tag asWord;
} scratchPad; //scratchPad is the variable name
Example 95: Using a union to create a scratch pad
As with structures, union elements are accessed with the dot and right arrow
operators. Use the dot operator to specify an element by placing it after the
name given to the union. In the following example, the data in the
scratchPad memory block is interpreted as a char.
scratchPad.asChar = ‘b’; //assign b to scratchPad
tempChar = scratchPad.asChar; //retrieve as character
Example 97: Accessing a union element with the dot operator
If you indicate the union with a pointer, use the right arrow operator to specify
an element. In the following example, scratchPad is interpreted as an int.
union tagName * scratchPad_ptr; //declare pointer type
scratchPadPtr = &scratchPad; //point to scratchPad
someInt = scratchPad_ptr->asInt; //retrieve as integer
Example 98: Using the right arrow operator to access a union member
136
Complex Data Types
Since the compiler uses a single block of memory for the entire union, it
allocates a block large enough for the largest element in the union. For example,
the compiler will allocate a 16 bit block for the union scratchPad in
Example 98 because the elements asLong and asFPtr require 16 bits16.
The compiler will align the first bit of each element in the memory block. If you
assign a 16 bit value to scratchPad and then read it as an 8 bit value, the
compiler will return the first 8 bits of the data stored.
NOTE
Verify your target hardware’s method for storing 16 bit integer values. Some
hardware stores long data with a higher address for the low byte. This is called big
endian because the “big end” comes at the end. Other hardware stores the high
byte at the higher address. This is called little endian because the “little end” comes
last. The results returned from extracting 8 bits from a 16 bit value will differ
depending on the hardware storage method.
The scratchPad variable can handle the 16 bit value as a word and can
provide access through a structure to either byte in the word. This is useful so
you can use the asWord element to return a specific part of the word.
scratchPad.asLong = someLong;
someInt = scratchPad.asWord.lowByte;
Example 99: Returning the low Byte of a word
Notice that the scratchPad example assumes the target hardware is big
endian (high byte last). For a little endian target (low byte last), the asWord
element needs to be defined as follows17. Notice that redefinition does not
affect the statements in the previous example.
struct hilo_tag {
short highByte;
short lowByte;
} asWord;
Example 100: Returning a specific part of a word for little endian
17To promote even greater portability and clarity define a new data type called BYTE
based on the underlying 8 bit data type on the target hardware.
137
Complex Data Types
someUnion.asFloat = someFloat;
someInt = someUnion.asInt;
Example 101: Incompatible variables with different storage methods in unions
138
12. Storage and Data Type Modifiers
C provides the capability to further specify how stored values should be
interpreted with the use of storage class and data type modifiers. Many of
these modifiers have been introduced briefly in other sections of this book.
Both storage class and data type modifiers are keywords which are included in a
variable or function data type declaration.
139
Storage and Data Type Modifiers
NOTE
Unlike other internal linkage objects, static local variables need not be unique within
the compilation unit. They must be unique within the statement block which contains
their scope.
Objects with internal linkage typically occur less frequently than objects with
external or no linkage.
12.1.3 No linkage
An identifier with external linkage can be used at any point within a program as
long as it is visible. Suppose the function int Calculate_Sum() is
declared in a source file. If you want to use this function in any other
compilation unit, you must tell the compiler where to look for the function
140
Storage and Data Type Modifiers
Like functions, global variables have external linkage. To use a global variable in
more than one source file, you must declare it as extern:
extern int myGlobalInt;
141
Storage and Data Type Modifiers
Preprocessor directives are placed at the top of the file to handle the EXT tag in
each definition:
#ifdef MAIN
#define EXT “ ”
#else
#define EXT “extern”
#endif
Example 103: Using preprocessor directives to declare extern global variables
By default, all functions and variables declared in global space have external
linkage and are visible to the entire program. Sometimes you require variables
or functions which have internal linkage: they are visible within a single
compilation unit. Use the static keyword to restrict the scope of variables:
static int myGlobalInt;
static int staticFunc(void);
Example 104: Using the static data modifier to restrict the scope of variables
These declarations create global identifiers which are not accessible by any
other compilation unit. Any function within the same compilation unit as the
static variable declarations can access these identifiers.
The static keyword can be used to create permanent local variables. For
example, consider the task of tracking the number of times a recursive function
calls itself (the function’s depth). You can accomplish this using a static
variable:
142
Storage and Data Type Modifiers
int myRecurseFunc(void) {
static int depthCount=1;
depthCount += 1;
if ( (depthCount > 10) || (!DONE) ) {
myRecurseFunc();
}
}
Example 105: Using static variables to track function depth
143
Storage and Data Type Modifiers
NOTE
If you use register ensure that the code for the variable declaration is close to the
code where the variable is used. This minimizes the overhead expense of
dedicating a register for storage of a single particular variable.
The auto keyword denotes a temporary variable. You may only use auto
with variables because C does not support functions with local scope. Since all
variables declared inside a statement block have no linkage by default, the only
reason to use the auto keyword is for clarity:
int someFunc(NODEPTR myNodePtr) {
extern NODEPTR TheStructureRoot;
// global pointer to data structure root
auto NODEPTR tempNodePtr;
// temporary pointer for structure manipulation
...
}
Example 107: Using the auto data modifier
144
Storage and Data Type Modifiers
Data type modifiers alter the way information is recorded and retrieved. Type
modifiers extend the basic data types available. Type modifiers apply to data
only, not to functions. You can use them with variables, parameters, and
returned data from functions.
Some type modifiers can be use with any data while others are used with
specific types of data such as pointers.
const
Sometimes you want to create variables with unchangeable values. For example,
if your code makes use of *, the constant PI, then you should place an
approximation of the value in a constant variable:
const float PI = 3.1415926;
When your program is compiled, the compiler allocates ROM space for your
PI variable and will not allow the value to be changed in your code. For
example, the following assignment would produce an error at compile time:
PI = 3.0;
volatile
Volatile variables are variables whose values may change without a direct
instruction. For example, a variable which contains data received from a port
will change as the port value changes.
Using the volatile keyword informs the compiler that it can not depend
upon the value of a variable and should not perform any optimizations based
on assigned values.
145
Storage and Data Type Modifiers
You can direct the compiler to permit integer data types to contain negative as
well as positive values. You can also restrict integer data types to positive values
only. The sign value of an integer data type is assigned with the signed and
unsigned keywords.
signed
The signed keyword forces the compiler to use the high bit of an integer
variable as a sign bit. If the sign bit is set with the value 1 then the rest of the
variable is interpreted as a negative value. By default, short, int and long
data types are signed and the signed keyword need not be used. The char
data type is unsigned by default. To create a signed char variable you must use
a declaration such as:
signed char mySignedChar;
If you use the signed keyword by itself the compiler assumes that you are
declaring an integer value. Since int values are signed by default, programmers
rarely use the syntax: signed mySignedInt;.
unsigned
To create unsigned short, int, or long data types use the unsigned
keyword. You need never use the keyword with char values because they are
unsigned by default. This keyword forces the computer to read the high bit
as part of the variable value:
unsigned int myUnsignedInt;
If you use the unsigned keyword alone the compiler assumes the variable
you are declaring is an int. C programmers often use the following syntax:
unsigned myUnsignedInt;
The short and long modifiers instruct the compiler how much space to
allocate for an int variable. The resulting variable is interpreted as an int, but
the number of bits used to store the variable value may change.
146
Storage and Data Type Modifiers
short
The short keyword declares an int of the same size as a char variable:
usually 8 bits:
short int myShortInt;
On microcontrollers where the natural machine unit is the byte a short int
is usually the same size as an int. Some compilers allow two byte int
variables. In these cases, the short int remains 8 bits in size.
If you use the short keyword alone, the compiler assumes the variable is a
short int type:
short myShortInt;
long
The long keyword declares an int twice as long as a normal int variable:
long int myLongInt;
On some computers a long is not twice the size of an int. However, long
will always be the same size or larger than int and short will always be the
same size or smaller than int.
On microcontrollers a long int occupies two bytes. If the compiler allows
you to use 16 bit int data types, the long and int are usually the same size
because of the fact that long data types always occupy two bytes.
The near and far keywords are common extensions to standard C. They
allow different size pointers to address different areas of computer memory.
near
The near keyword creates a pointer which points to objects in the bottom
section of addressable memory. These pointers occupy a single byte of memory,
and the number of memory locations to which they can point is limited to the
first 256 locations, or from $0000 to $00FF.
int near * myNIntptr;
147
Storage and Data Type Modifiers
For efficient RAM access, most microcontrollers place user RAM in the low
memory addresses. Thus, near pointers usually point to data stored in user
RAM such as user defined variables.
far
The far keyword creates a pointer which can point to any data in memory:
int far * myFIntptr;
These pointers take two bytes of memory which allows them to hold any legal
address location from $0000 to $FFFF. far pointers usually point to objects
in user ROM, such as user defined functions and constant variables.
Since the implementation of near and far pointers varies from target to
target the default method of creating pointers also varies. For example, what
kinds of pointers do the following two declarations generate?
int * myIntPtr;
const int * myConstIntPtr;
On most target machines, the compiler generates a near pointer for the first
declaration and a far pointer for the second. Since the compiler knows that
const int data is stored in ROM it knows a far pointer is needed.
148
Storage and Data Type Modifiers
If you use pointers extensively you must know the default pointer type. Many
embedded developers do not use pointers extensively as they are very CPU
intensive. This is especially true with the far pointer double byte values.
149
13. The C Preprocessor
Every C language environment has a preprocessor. As the name suggests, the
preprocessor examines program code before it is processed by the compiler.
The preprocessor reads a source code file line by line and performs the
preprocessor directives it finds.
The preprocessor does not understand the C language. This can be a source of
great trouble for program developers as it is easy to miss problems caused by
passing the preprocessor invalid commands. Two common errors are including
a semicolon to terminate a macro definition and placing a comment on the
same line as a directive. Since the preprocessor does not understand the C
interpretation of semicolons or comments it will attempt to read these things as
part of the directive.
Some C environments support an option which invokes only the preprocessor
for a source file. This has the advantage of letting you look at the preprocessor
results before the source gets passed to the compiler.
Any source code line that begins with the hash character, #, is a command to
the preprocessor and is called a preprocessor directive. It is good practice to
justify these directives against the left hand margin to distinguish them from
your C code. Historically, pre-ANSI compilers required preprocessor directives
to begin in column one of a source code line. This practice should not be
followed when you nest directives:
#if DEBUG
#include <debug.h>
#endif
Example 109: Nesting preprocessor directives
151
The C Preprocessor
Unlike the C compiler, white space is very important to the preprocessor. For
example, in C both the following function definitions are acceptable:
int smallest (int arg1, int arg2);
int largest(int arg1, int arg2);
The preprocessor is not so forgiving. Only one of the following two macros
performs as expected:
#define SMALLEST (arg1,arg2) ((arg1)<(arg2)?(arg1):(arg2))
#define LARGEST(arg1,arg2) ((arg1)<(arg2)?(arg1):(arg2))
SMALLEST is defined as an object macro or symbolic constant, not as a function
macro like LARGEST as intended. Thus a call to SMALLEST will be expanded
by the preprocessor into the monstrosity:
(arg1,arg2)((arg1)<(arg2)?(arg1):(arg2))(oneInt,twoInt);
When the preprocessor sees this directive it will look for the file machine.h
and replace the directive with the contents of machine.h. The preprocessor
will then continue searching through source code. The next line it will look at
will be the first line of the machine.h file.
If the preprocessor cannot find the specified file, it will give an error and quit
processing. Where does the preprocessor look for the file?
152
The C Preprocessor
<filename.h>
If you surround the file name with angle brackets the preprocessor will look for
the file in a system dependent location determined by the compiler you are
using.
In general, angle brackets produce two types of searching. On some systems,
the preprocessor will look through a directory or list of directories you have
specified as containing the library and header files for your compiler. On other
systems the preprocessor will look through a directory or list of directories
specified in the operating system environment as a location for commands.
“filename.h”
If you surround the file name with double quotes, the preprocessor behaviour
is more complex.
1) The preprocessor looks for the file in a system dependent location. This may
be the same location used for <> inclusion; however, it usually is not. If
the preprocessor searches for include files in a single location, the
preprocessor does not support “” inclusion and treats it as <> inclusion.
2) If the file is not found, the preprocessor will retry the directive as if the file
were surrounded by angle brackets.
In general practice, the double quotes signal the preprocessor to look for the
file in the same place as the source code file containing the directive.
NOTE
If the preprocessor can not find the file in the place for “” inclusion it will reprocess
the directive as if it used <> inclusion syntax.
The common misconception that “” inclusion refers to the current directory can lead
to errors. You must check your compiler documentation to determine exactly where
and how “” and <> inclusion look for files.
153
The C Preprocessor
You can see that the symbol might convey more meaning in the code than its
value alone.
You may want to redefine the value of a symbolic constant. The preprocessor
may give an error if you attempt to define a symbol that is already defined.
154
The C Preprocessor
According to the ANSI standard you can redefine a symbolic constant with a
replacement string which is exactly similar. Despite this, it is best to be
scrupulous about using #undef for symbols before you redefine them.
You must tell the preprocessor to remove the symbol from its list before you
can redefine it.
#undef MAXINT
#define MAXINT +127
Example 110: Redefining a constant using #undef
Suppose you have a small set of functions that you want to keep 8 bit portable,
while allowing remaining functions to use 16 bit int values. The following
directives would be used:
1) Define MAXINT for 16 bit #define MAXINT +32768
2) Undefine MAXINT #undef MAXINT
3) Define MAXINT for the 8 bit #define MAXINT +127
Undefining a symbol has no effect if a symbol is not defined, the preprocessor
simply ignores the #undef directive.
This directive instructs the preprocessor to place the symbol 8BITINT into its
symbol list with no associated value. If you use the symbol in your code the
preprocessor replaces it with nothing. This can easily lead to compiler errors.
155
The C Preprocessor
// program code
someInt = SMALLEST(oneValue, twoValue);
Example 111: Defining and calling a macro
NOTE
Because a function macro looks similar to a function call it can be difficult to tell
macro functions and regular functions apart. It is good coding practice to use upper
case for all macro names so they are easily distinguished from functions code.
156
The C Preprocessor
NOTE
Using any expression that causes side effects as an argument to a macro or a
function call is not good practice and can cause unexpected results.
To expand macro parameters inside quotes you need to use the # and ##
operators
The #if and #endif directives include code when the #if expression
evaluates to a non-zero integer value:
#define DEBUG 1
#if DEBUG
#include <debug.h>
#endif
Example 112: Using #if and #endif to conditionally compiler code
Blocks of code such as that in Example 112 are often used to produce both a
debugging and final version of a program. The first line defines the DEBUG
symbol with the value 1. The #if directive tests its argument expression to see
if it has a non-zero constant integer value. When DEBUG has a non-zero value,
157
The C Preprocessor
the preprocessor will #include a header file created for debugging called
debug.h.
Because #if accepts an expression as an argument, you can also do the
followings to check for the value assigned a symbolic constant:
#define DEBUG_STATE 1
#if DEBUG_STATE == 1
#include <debug1.h>
#endif
Example 113: Using expressions in #if directives for conditional compilation
The constant integer expression tested by #if cannot contain the sizeof()
function, type casts, or enum constants. However, you can use the
defined() function with #if directives. The defined() function
returns 1 if its argument is a defined symbol. If the symbol is not defined, it
returns 0. Therefore, we can rewrite Example 113 as follows:
#define DEBUG
#if defined(DEBUG)
#include <debug.h>
#endif
Example 114: Using the defined() function for conditional compilation
You can also use !defined() to test if a symbol has not been defined. It will
return 1 if its argument is not a defined symbol and 0 if the argument is defined:
#if !defined(DEBUG)
#include <machine.h>
#endif
Example 115: Using !defined() to test if a symbol has not been defined
158
The C Preprocessor
#define DEBUG 1
#if DEBUG == 1
#include <debug.h>
#else
#include <machine.h>
#endif
Example 116: Using #else and #elif to choose between compilation blocks
The #error directive halts the preprocessor and produces a specified error
message. Most compilers provide additional information with your message,
159
The C Preprocessor
such as the name of the source file and the position of the error directive within
that file:
#if STATE == DEBUG
#include <debug.h>
#elif STATE == RELEASE
#include <machine.h>
#else
#error Bad or missing STATE value: need DEBUG or RELEASE
#endif
Example 119: Using the #error directive
160
14. Libraries
Technically, a library in C is simply a collection of C functions. Libraries usually
contain functions which serve a common purpose, such as interfacing to an
LCD, using a timer, providing mathematical capabilities, or converting data
types. The functions within a library are a collection of the basic operations
defined by the scope of the library. For instance a math library would contain
routines for multiplication, division, and modulus.
Because high level languages are very portable, libraries written in high level
languages are also very portable. Portability is made possible by the
standardization of high level languages such as C. C language code written on a
PC will compile and run on MAC or UNIX machines often with little or no
alternation. Similarly, C code written for a specific 8 bit microcontroller can be
compiled and run on a different microcontroller with very minor changes to the
code.
Although libraries for math and data type conversion are useful, they are not
the libraries most useful in embedded systems development. By definition a
microcontroller embedded within a system needs to receive data in and sends
data out. This is most often done with devices such as keyboards, LCD
displays, serial interfaces, and I/O ports. At times it is necessary to convert this
data to a specific format so that it can be understood. Devices such as Analog
to Digital and Digital to Analog converters provide such conversion
capabilities. Libraries which support peripheral devices are very useful in
embedded systems development.
161
Libraries
2) The libraries have been thoroughly tested and debugged allowing faster
hardware/software integration
3) The embedded programmer does not need to know the low level hardware
details of how the device operates.
4) Support for multi-controller systems which use microcontrollers from
different families. C source code can be ported between families by
changing the included header file. This saves the embedded programmer
from having to learn implementations on different microcontroller.
5) Software reusability is maximized.
Some useful portable libraries would provide routines for:
1) SPI (Serial Peripheral Interface)
2) Microwire
3) SCI (Serial Communications Interface)
4) UART (Universal Asynchronous Receiver Transmitter)
5) USART (Universal Synchronous Asynchronous Receiver
Transmitter)
6) Analog to Digital conversion and Digital to Analog Conversion
7) I/O ports
8) LCD displays
9) PWM (Pulse Width Modulation)
10) Timers
Suppose you have been given the task of implementing a SPI serial interface
between a Microchip PIC16C74, National COP8SAA and a Motorola
68HC05C8. You have only programmed for the Microchip PIC and you are not
familiar with SCI serial interfaces. You could learn how SPI works, find out
how it is implemented on the different chips, learn how to code for the
different chips, write drivers for each chip, and then finally debug the hardware.
This development process could take a very long time! By drawing on a
portable library for the SPI you can write C code using library functions and
avoid delays in project development.
162
Libraries
char SPI_in[REC_SIZE];
const char o[] = {0b10000001,0b10000010,0b01000100,
0b00001000, 0b00010000};
void main(void){
SPI_array_get(SPI_in); //set the array to store data in
SPI_array_send(o); //sets the array to send data from
//The following statement configures the SPI
//The argument that is passed depends on the desired
//configuration. The instructions on how to set this
//are found in the device driver headers
SPI_set_master(0b00100000);
SPI_flush(); //send a byte to get everything synched
SPI_send_rec(0,4); //initiate the send/receive function
while(1){
}
}
Example 120: Master function for PIC16C74 SPI communication
The master source code in Example 120 can be compiled for different chips
with very minor changes and the library calls would work as expected.
163
Libraries
The library calls are those which begin with the letters SPI such as
SPI_array_get(SPI_in), SPI_array_send(o);,
SPI_set_master(0b00100000);, SPI_flush(); and
SPI_send_rec(0,4);. We will now examine some of these functions in
detail by looking at excerpts from specific device libraries.
14.2.2 SPI_set_master(ARGUMENT);
This function configures the SPI. The following sections describe how it is
implemented in the libraries for the individual chips.
164
Libraries
WCOL<7>
"# 1 The SSPBUF register is written while transmitting the previous
word. Must be cleared in software.
"# 0 No Collision
/*=========================================
This function configures the SPI and sets up the proper
pins for serial port operation.
ARGUMENTS:
temp, The byte to set the SPI
======================================================*/
165
Libraries
void master(void){
PORTGC.4 =1;
PORTGC.5 =1;
PORTGC.6 = 0;
PORTGD.6 = 1;
CNTRL.MSEL = 1;
}
//an alias to create a uniform library
#define SPI_set_master(ARG) MW_set_master(ARG)
Example 123: Setting up SPI on the National COP8SAA7
14.2.3 SPI_send_rec(0,4);
This function initiates the send/receive function. The following sections show
the device specific functions. The function starts at array index 0 of the receive
and transmit arrays and transfers information up to index 4. With SPI
information is received and transmitted at the same time.
ARGUMENTS:
ARG2 is a pointer to the array or data you wish to send
ARG3 n is the array index to start from
ARG4 offset is the array index to go up to
================================================*/
166
Libraries
#define SIOP_send_rec(ARG2,ARG3,ARG4)\
SPI_array_send(ARG2); \
SPI_send_rec2(ARG3, ARG4);
while(n != offset){
SDR = *ARRAY_SEND;//
SPI_out[n]; // load the SSPBUFF
//SPIF flag indicates transmission is done
while(SSR.SPIF == 0){
}
*(ARRAY_GET+n) = SDR; //store the returned byte
ARRAY_SEND = ARRAY_SEND + 1 ;
n=n+1;
}
}
//note the use of an alias!
#define SPI_send_rec(ARG2,ARG3,ARG4) \
SIOP_send_rec(ARG2,ARG3,ARG4)
Example 125: Initiating SPI send/receive on the Motorola 68HC705C8
167
Libraries
As we can see from the individual functions, the library prevents the user from
having to know the specific hardware configuration of each processor. In
particular, the use of aliases allows the user to refer to the functions in the most
familiar way possible. One user might be most familiar with the Microchip PIC
and wish to refer to the functions as SPI. However, another user might be most
familiar with the National COP8 and wish to refer to the functions as MW
(Microwire).
168
15. Sample Project
This section covers a sample embedded system project. The project interfaces a
microcontroller with a SPI (UART) peripheral to a PC via the RS-232 port. The
most common and easiest technique for interfacing to a PC is to use the parallel
port where there are eight parallel bits for input and output. However, it is very
easy to damage the parallel port. On PCs with the parallel port on the
motherboard a damaged parallel port can require a new motherboard.
The serial port is more complicated but it is a much better tool for interfacing
to a desktop PC. It is very difficult to damage your computer by manipulating
the serial port. Also, the hardware is almost universally standard. Once you
build an embedded system with RS-232 support you can hook it up to a PC,
MAC, or another embedded system merely by changing the interface software
This project will introduce some key embedded system programming concepts
such as interrupts, registers, and peripherals.
The project uses the portable device driver libraries discussed in Section 14,
Libraries. The specific hardware implementation will be on a Microchip PIC
16C74. The code is written using Borland C functions. If you do not use
Borland these functions are most likely supported by your favourite compiler,
where they may have slightly different names.
The concepts and terminology necessary for this project are discussed in the
following sections.
15.2.1 Asynchronous
Devices that are synchronized in the electronics world use the same clock and
their timing is in synchronization with each other. Things that are asynchronous
have their own timing and clocks. In the world of serial communications it is
easy to tell if something in synchronous or not: if there is a clock line it is
synchronous, if there is no clock line it is asynchronous.
169
Sample Project
15.2.2 SCI
15.2.3 RS-232
IDLE Start Bit 0 Bit 1 Bit 2 Bit 3 Bit 4 Bit 5 Bit 6 Bit 7 Stop
Figure 10: RS-232 signal
An idle serial line going from high to low is a signal to start receiving data. By
using the baud rate, the receiver knows exactly how long each bit will be, so it
can distinguish bits from each other. After 8 bits are received the line goes high
again and the receiver waits for the next start bit. After a byte has been received,
it can be taken from the serial port receive register and used by the computer.
The transmitter hardware handles the start and stop bits. Usually all we have to
do is load up the serial port transmit register and wait for an interrupt or use
device polling to determine when the transmission is complete.
170
Sample Project
The PIC 16C74 contains a hardware SPI port that lets us transmit
asynchronous serial data. We can be notified by interrupt or by polling when
the chip has finished sending or receiving a byte. We will examine the serial port
on the PC in detail.
The next time your PC boots, examine the screen which contains the BIOS
information. The BIOS will tell you what serial ports you have and will display
their hex address. The column “IRQ” in Table 10 is the name of the interrupt
that is associated with the port.
171
Sample Project
Depending on the vintage of your computer you may have any one of the
following UART chips:
CHIP COMMENTS
8250,8250A, 8250B These were the first UARTS.
16450, 165501, These are what the majority of you will have. 16450 was
6550A used in AT’s but is still quite common. The 16550 had
some problems and was replaced by the 16650A which
has a 16 byte FIFO
16650 The newest UART.
Table 11: UART chips
What is a FIFO?
A FIFO is a buffer. FIFO stands for First In First Out. A UART with a FIFO
can store data and therefore does not have to interrupt the CPU as often
because it can transfer many bytes at each interrupt service.
The variety of UART chips does not affect software development a great deal.
The UART chips are all supersets of previous UARTs. Unless you are
interested in super high performance communications, you can program these
chips in exactly the same way. Of course, if you run code for FIFO chips on
FIFOless chips the FIFO will not be working. For reasons of simplicity and
portability the code in this book will not use a FIFO.
15.4.3 IRQ
Everything you know about interrupts from embedded systems holds true for
larger computers. However, the memory address range is much bigger so
vectors will be several bytes.
The original PC was designed with 256 interrupt vectors for both hardware and
software. These were each 4 bytes in length for a total of 1024 (256 ' 4) bytes
in memory. As a whole this areas of memory is called the interrupt vector table.
For example, INT 0 uses memory locations 0x00000, 0x00001, 0x00002 and
0x00003 while INT8 uses the four bytes at 0x0020, 0x0021, 0x0022 and 0x0023.
Eight hardware interrupts beginning at INT8 are reserved. They are called
IRQ0-IRQ7, thus IRQ0 corresponds to INT8, IRQ1 to INT9 and so on.
172
Sample Project
Now that we know about the vector table we have to examine a few other
registers:
Address Read/Write Abbreviation Name
BASE +
0 (DLAB = 0) W Transmit Holding Buffer
0 (DLAB = 0) R Receiver Buffer
0 (DLAB = 1) R/W Divisor Latch Low Byte
1 (DLAB = 0) R/W Divisor Latch High Byte
1 (DLAB = 1) R/W IER Interrupt Enable Register
2 R IIR Interrupt Identification
Register
2 W FCR FIFO Control Register
3 R/W LCR Line Control Register
4 R/W MCR Modem Control Register
5 R/W LSR Line Status Register
6 R MSR Modem Status Register
7 R/W Scratch Register
Table 12: COM port registers
The table depicts the registers associated with each COM port. The registers are
located at the base port address plus an offset. For example, the Line Status
Register for COM1 is at 0x03FD = (0x03F8 + 5). The DLAB bit is similar to a
paging bit, it allows the access of different registers at the same address. For
example, to access the IER set the DLAB bit and access BASE +1.
The following paragraphs describe each register:
173
Sample Project
the Divisor Latch Low/High registers and the baud rate is changed to 115200
/ Divisor. For example, for a 2400 BAUD rate, we want the divisor to be 48
(115200/2400 = 48). We write 48 (or 0x30) into these two registers by placing
0x00 in the high byte register and 0x30 in the low byte register.
174
Sample Project
175
Sample Project
176
Sample Project
The serial port has two associated IRQs, IRQ3 and IRQ4. To refer to these in a
program we must refer to them by interrupt vector table entry: 0x0B for IRQ3
and 0x0C for IRQ4. (IRQ0 is located at 0x08). There are two useful macros
provided by Borland called enable() and disable(). These functions
enable and disable all interrupts which is useful for when we are carrying out
interrupt related programming and do not wish our program to be interrupted
by other interrupt service requests.
177
Sample Project
Good programming practice dictates that we return the contents of the vector
location when our program is finished. If we do not, programs which use the
serial port may not run because they will be directed to our interrupt service
routine.
Programming interrupts on a PC is much like programming them on an
embedded system. There is a definite series of steps one must use:
178
Sample Project
NOTE
You may encounter problems attempting to perform some operations in an interrupt
service routine. For instance if you try to write data to disk, the system may hang
because the disk drive is trying to use an interrupt but is unable to do so.
#include "16c74.h"
#include "port.mpc"
#include "MPCsci2.h"
void main(void){
179
Sample Project
while(1){
}
}
void __INT(void){
INTCON.GIE = 0;
if(PIR1.TXIF == 1){
SCI_int_svcst(); // macro from the device library
}
RestoreContext;
return;
}
Example 127: Serial port connection example for the PIC16C74
15.6.2 PC Code
#include <dos.h>
#include <stdio.h>
#include <conio.h>
#include <string.h>
#define PORT1 0x03f8 //we want to use COM 1
#define COM1 0x03f8
#define IRQ 0x0C //the INT for COM1 (IRQ3)
outportb(0x20,0x20);
//if the first bit of the LSR is set, it means
//that the UART has received information
do { c = inportb(PORT1 + 5);
if (c & 1){ //so we get it!
chin[gn] = inportb(PORT1);
printf("%c", chin[gn]); // print message
180
Sample Project
}
}while (c == 1);
int_done = 1;
gn = gn + 1;
enable(); // turn back on the interrupts
}
int main(void){
int n = 0;
outportb(PORT1 + 1 , 0); //Turn off COM1 interrupts
disable(); //Borland macro to turn off all interrupts
oldfunc = getvect(IRQ); //store old interrupt vector
setvect(IRQ, int_svc ); // Set new interrupt vector
//Communication Settings
outportb(PORT1 + 3 , 0x80); // SET DLAB ON
outportb(PORT1 + 0 , 0x0C); // Set Baud rate Low Byte
outportb(PORT1 + 1 , 0x00); // Set Baud rate Hi Byte
outportb(PORT1 + 3, 0x00); // The DLAB is zero
181
Sample Project
Now all that we require is the hardware necessary to turn the electrical signals
from the PIC into RS-232 levels. This is quite easy and there are a number of
chips that can turn 5 volt TTL levels into RS-232 levels off of a standard 5 volt
power supply. They use a “bucket brigade” of capacitors to build the needed
potential difference. This project uses a MAX232A, but you can use any of the
many alternatives as long as you follow the schematic:
P1 CONNECTOR DB9
5 9 4 8 3 7 2 6 1
VCC
R1
10
C4
0.1
C5
U1
0.1 16
13 12
R1IN R1OUT RX - to MCU
8 9
R2IN VCC R2OUT
TX - to MCU
11 14
10 T1IN T1OUT 7
T2IN T2OUT
1
3 C1+
C1 4 C1-
5 C2+
C2 2 C2-
6 V+
0.1 V- GND
0.1 C3 15
MAX232A
0.1
182
Sample Project
There are two types of serial connectors: the DB9 has 9 pins and the DB25 has
25 pins. The pins are usually marked on the connector so it is easily determined
which pins are which. The interface to the microcontroller can be thought of as
a DCE, or Data Communications Equipment, and therefore needs only a
straight through cable.
The RS-232 protocol defines two types of devices, DTE, Data Terminal
Equipment and DCE, Data Communication Equipment. DTE is generally used
with PCs and DCE is usually found on modems. The pins, DTR, DSR, CD,
RTS, and CTS are only useful with a modem, i.e. connecting a DTE to a DCE.
We will just loop these pins back and trick the PC into thinking it is talking to a
modem. This way data can flow freely on the TX and RX pins.
NOTE
Ensure that you connect the ground on both parts of the circuit together or it will not
work because the electrical signals will not be able to complete a circuit.
You can make many interesting and fun projects using the serial port. The
project described in this section can be very useful even if you do not have any
micro controllers to interface with. It is very useful for learning key embedded
programming concepts like interrupts, registers, serial communications, and
timing. You can easily hook up two PCs to transfer files and run dumb
terminals. If you connect two PC’s together remember, that you are connecting
two DTEs, which will require a null modem cable.
183
16. C Precedence Rules
Expression type Operators
Primary Identifier
Constant
String
Expression
Postfix a[b] f() a.b
a-- a++
Unary ++a --a
sizeof a sizeof(a)
&a *a ~a
!a +a -a
Cast (type) a
Multiplicative a*b a/b a%b
Additive a+b a-b
Shift a << b a >> b
Relational a<b a>b
a <= b a >= b
Equality a == b a != b
Bit AND a&b
Bit EOR a^b
Bit OR a|b
Logical AND a && b
Logical OR a || b
Conditional a?b:c
Assignment a=b
a += b a -= b
a *= b a /= b a %= b
a &= b a ^= b a |= b
a <<= b a >>= b
Comma a,b
Table 21: Rules of operator precedence
Operations higher up in the table have precedence over those lower. Those at
the same level execute in the order they appear. The optimizer often regroups
sub-expressions that are both associative and commutative in order to improve
the efficiency of generated code. The order of any side-effects, such as
185
C Precedence Rules
186
17. ASCII Chart
HEX ASCII HEX ASCII HEX ASCII HEX ASCII
00 NUL 20 SP 40 @ 60 `
01 SOH 21 ! 41 A 61 a
02 STX 22 “ 42 B 62 b
03 ETX 23 # 43 C 63 c
04 EOT 24 $ 44 D 64 d
05 ENQ 25 % 45 E 65 e
06 ACK 26 & 46 F 66 f
07 BEL 27 ‘ 47 G 67 g
08 BS 28 ( 48 H 68 h
09 HT 29 ) 49 I 69 i
0A LF 2A * 4A J 6A j
0B VT 2B + 4B K 6B k
0C FF 2C , 4C L 6C l
0D CR 2D - 4D M 6D m
0E SO 2E . 4E N 6E n
0F SI 2F / 4F O 6F o
10 DLE 30 0 50 P 70 p
11 DC1 31 1 51 Q 71 q
12 DC2 32 2 52 R 72 r
13 DC3 33 3 53 S 73 s
14 DC4 34 4 54 T 74 t
15 NAK 35 5 55 U 75 u
16 SYN 36 6 56 V 76 v
17 ETB 37 7 57 W 77 w
18 CAN 38 8 58 X 78 x
19 EM 39 9 59 Y 79 y
1A SUB 3A : 5A Z 7A z
1B ESC 3B ; 5B [ 7B {
1C FS 3C < 5C \ 7C |
1D GS 3D = 5D ] 7D }
1E RS 3E > 5E ^ 7E ~
1F US 3F ? 5F _ 7F DEL
Table 22: ASCII characters
187
18. Glossary
accumulator
Also AC, ACC. A register which holds the resulting values of ALU operations.
a/d
Analog to digital.
address
A number which indicates the storage location of data in memory.
addressing mode
The syntax used to describe a memory location to the CPU.
algorithm
A solution to a problem.
ALU
Arithmetic Logic Unit. Performs basic mathematical manipulations such as add, subtract,
complement, negate, AND, OR.
analog
A continuous range of voltage values.
AND
Logical operation where the result is 1 iff ANDed terms both have the value 1.
ANSI C
American National Standards Institute standards for C language.
array
A group of data elements indexed and stored in contiguous memory.
ASCII
The American Standard Code for Information Interchange is used to represent characters.
assembler
Program that converts a machine’s assembly language into object code.
assembly language
Mnemonic form of a specific machine language.
assignment
Store a value in a variable.
189
Glossary
asynchronous
Unclocked or not synchronous with CPU timing.
bank
A logical unit of memory (64k).
baud
The number of bits transmitted per second
binary
Base 2 number system which contains only the numbers 0 and 1.
bit
Binary digit which is either 0 or 1.
bit field
A group of contiguous bits considered as a unit.
block
Any section of C code enclosed by braces {}.
breakpoint
A set location to stop executing program code. Breakpoints are used in debugging
programs.
bus
Path for signals between components of a computer system .
byte
Eight bits.
C
High level programming language.
cast
Also Coerce. Convert a variable from one type to another.
checksum
A value which is the result of adding specific binary values. A checksum is often used to
verify the integrity of a sequence of binary numbers.
clear
Set a bit to 0.
clock
Fixed-frequency signal that triggers or synchronizes CPU operation and events. A clock has
a frequency which describes its rate of oscillation in MHz.
190
Glossary
comment
Non-executed text included in a program in order to explain what the executable
statements in the program are doing.
compiler
Program that converts a high level language to object code.
computer operating properly (COP)
control statement
Statement which controls the execution of other statements based on conditions provided
by the programmer.
CPU
Central Processing Unit. It fetches, decodes and executes instructions.
cross assembler
An assembler that runs on one type of computer and assembles the source code for a
different target computer. For example, an assembler that runs on a 80486 and generates
object code for Motorola’s 68HC05.
cross compiler
A compiler that runs on one type of computer and compiles source code for a different
target computer. For example, a compiler that runs on a 80486 and generates object code
for Motorola’s 68HC05.
crystal
A quartz crystal which provides a frequency for clock timing.
debugger
A program which helps with system debugging where program errors are found and
repaired. Debuggers support such features as breakpoints, dumping, memory modify.
decision statement
Statement which controls the program flow based on the result of testing a condition.
declaration
A specification of the type, name and possibly the value of a variable.
decoder
The unit which decodes bits into mutually exclusive outputs.
dereference
Also *. Access the value pointed to by a pointer.
directive
A command given to the preprocessor which begins with a #.
191
Glossary
EEPROM
Electrically erasable programmable read only memory.
embedded
Fixed within a surrounding system or unit.
escape character
The / character in C can be used as an escape character.
executable
A file which contains code which can be run on a specific target device.
fixed point
Integer representation where the decimal is in a fixed position.
floating point
The integer representation of decimal numbers using a mantissa field and an exponent field.
global variable
Variable that can be read or modified by any part of a program.
header file
Source code which is inserted into another source file using the #include preprocessor
directive.
hexadecimal
Also Hex. Base 16 numbering system which uses the digits 0-9 and the letters A-F.
include file
A file which is included by the preprocessor due to the use of the #include directive.
index register
Also X. Register used to hold an increment which can be added to an address when indirect
addressing is used.
integer
A number with no decimal, a whole number.
interrupt
A signal sent to the CPU to request service. The CPU saves its state and branches to a
routine to handle the interrupt. After the interrupt has been handled the saved state is
restored.
library
Collection of functions which are available for use by other programs.
192
Glossary
linker
A program which combines separate object files together in order to create an executable
file.
local variable
Variable that can only be used by a specific module or modules in a program.
logical operator
Operators which perform logical operations on their operands. For example, !, &&, ||.
machine language
Binary code instructions which can be understood by a specific CPU.
macro
Source code which is given a unique label. If the compiler sees the label in following source
code it will replace it with the body of the macro.
mask
A group of bits designed to set or clear specific locations in another grout of bits when
used with a logical operator.
maskable interrupt
Interrupts which software can activate and deactivate.
memory mapped
A virtual address or device is associated with an actual address in memory.
microcontroller
Also MCU. Single chip which controls another device and contains a CPU, memory and
I/O ability. A type of embedded controller.
microprocessor
Also µP. A single chip CPU.
module
A logically united part of a program which is in the same source code file.
nibble
A four bit binary number.
NOP
No operation. An instruction which is used to create a delay.
not
Logical negation. A 0 becomes a 1 and a 1 becomes a 0.
193
Glossary
object code
Machine language instructions represented by binary numbers not in executable form.
Object files are linked together to produce executable files.
octal
Base 8 number system.
operator
A symbol which represents an operation to be performed on operands. For example, +, *,
/.
or
A Boolean operation which yields 1 if any of its operands is a 1.
paging
A page is a logical block of memory. A paged memory system uses a page address and a
displacement address to refer to a specific memory location.
parameter
A variable used to pass information to and from a function.
pointer
An address of a specific object in memory which is used to refer to that object.
port
A physical I/O connection.
preprocessor
A program which prepares data for processing by the compiler.
program
Collection of instructions for a computer written in a programming language which
implement an algorithm.
program counter
Also PC. A register which holds the address of the next instruction to be executed. The
program counter is incremented after each instruction is fetched.
PROM
Programmable read-only memory. ROM that can be programmed.
RAM
Random Access Memory. RAM is read/write memory.
real number
A number which can have a decimal place.
194
Glossary
real time
A system which reacts at a speed commensurate with the time an actual event occurs.
recursive
A function which calls itself.
register
A byte or word of memory which can be directly accessed by the processor. Registers are
accessed more quickly than other memory locations. Some registers are CPU registers
which means that they exist within the CPU.
reset
To return to a selected beginning point.
return
An instruction which terminates a function.
ROM
Read Only Memory.
ROMable
Code which will execute when placed in ROM memory.
scope
A variable’s scope is the areas of a program in which it can be accessed.
sequencer
A module which provides the next program address to memory.
serial
Sequential transmission of one bit at a time using a single line.
set
Give a bit the value 1.
shift
Move the contents of a register to the left or right.
side-effect
An unintentional change to a variable.
simulator
A program which has the same input and output behaviour as a specific device. Timing
considerations can not be tested with a simulator.
source code
A program in assembly language or a high level language before it passes through an
assembler or compiler.
195
Glossary
stack
A section of RAM which is used to store temporary data. A stack is a last-in-first-out
(LIFO) structure which contains information which is saved and restored.
stack pointer
A register which contains the address of the top of the stack.
static
A variable that is stored in a reserved area of RAM instead of in the stack. The area
reserved cannot be used by other variables.
synchronous
Operations which are controlled by a clock pulse.
timer
UART
Universal asynchronous receiver/transmitter. A serial-to-parallel and parallel-to-serial
converter.
USART
Universal Synchronous/Asynchronous Receiver/Transmitter. A chip which handles
synchronous data communications.
variable
A symbolically named address or range of addresses which can be assigned values.
void
A C data type.
word
A 16 bit binary number.
196
19. Bibliography
Oualline, Steve. Practical C Programming. Sebastopol, CA: O’Reilly & Associates, 1991.
197
Index
20. Index
! =
! · See not operator = · See assignment operator
!= · See inequality operator == · See equality operator
# >
#define · 57 > · See greater-than operator
#include · 57 >= · See greater-than-or-equal
#pragma · 58
A
&
accumulator · 14
& operator algorithm · 83
precedence · 185 ALU · 13
&& · See and operator and operator · 93
arithmetic logic unit · See ALU
arithmetic operators · 88
* ASCII · 71, 76
assembler · 47
* operator assembly language · 46
precedence · 185 assignment operator · 87
assignment statement · 60
asynchronous · 16
{
{ · See braces
B
baud rate · 16
binary · 44
< binary notation · 79
binary operators · 85
< · See less-than operator bits · 45
<= · See less-than-or-equal block · 69
braces · 59, 69, 70, 102
bus · 6, 19
198
Index
operator · 88
C double data type · 81
double underscore · 1
central processing unit · See CPU
character · 71
character data type · 76
assigning · 76
E
clock · 11
collating sequence · 77 else statement · 101
comma operator · 87 matching with if · 102
comments · 56 emulator · 51
C++ · 56 equality operator · 92
compiler · 49, 50, 66, 69 equality operators · 91
cross compiler · 51 escape sequence · 77
constant · 67 expression
defining with #define · 57 evaluation · 84
control statement · 60 expressions · 84
control structure · 99 compared to statements · 84
CPU · 6, 19
cross compiler · See compiler
F
D floating point numbers · 80
function
data abstraction · 75, 113 body · 60
data type · 71 header · 60
character · 76 identifier · 68
double · See double data type prototype · 59
float · 81 functions · 58, 65
function · 75
integer · See integer data type
long · See long data type G
long double · See long double data type
modifiers · See modifiers GIE · 18
parameter · 76 global interrrupt enable · See GIE
short · See short data type greater-than operator · 93
dead code · 94 greater-than-or-equal · 93
decimal notation · 79
decoder · 13
decrement H
operator · 89
postfix · 89
Harvard architecture · 7
prefix · 90
header file · 57, 81
development platform · 5
hexadecimal · 46
directives · See preprocessor directives
hexadecimal notation · 79
division
integer · 88
199
Index
I M
identifer · See also variable machine code · 73
identifier machine language · 46, 50
and significant characters · 67 main() · 58
constant · 67 maskable interrupt · See interrupt
memory allocation · 66 memory
naming rules · 66 allocation for variables · 73
identifiers · 65 microcontroller · 5
if · 61 standard · 2
if statement · 100 microprocessor · 5
matching with else · 102 modifiers · 80
increment · 83 module · 57
operator · 89 modulus
postfix · 89 operator · 88
prefix · 90
index register · 14
inequality operator · 92 N
initialization code · 111
integer data type · 71, 78 nesting
assigning to a float · 81 if statements · 101
integer variables · 80 not operator · 93
interrupt · 6, 18
maskable · 18
non-maskable · 18
intger data type O
sign bit · 78
octal notation · 79
operator
K binding · 85
precedence · 86
operator precedence · 185
keywords · 66 operators · 83
arithmetic · 88
binary · 85
L postfix · 85
prefix · 85
LED · 55 trinary · 86
less-than operator · 93 unary · 85
less-than-or-equal · 93 or operator · 93
linker · 50
local variables · See variables, local
logical operators · 91 P
long data type · 72, 79
long double data type · 81 parameter · 59
loop · 61, 92 parameters · 76
infinate · 61
200
Index
201
Index
202
CATALOG 05/2001
MPC
! Supports all Microchip PIC 12x/14x/16x/17x ! Supports all 68HC08 variants
C6808 The compilers generate tight, fast, and efficient
executables, as well as listing files that match the original C
source to the code generated. Several optional reports
families, 8K and Flash parts ! Supports LOCAL memory reuse, SPECIAL memory (symbol information, nesting level, register contents) can
! Named address space supports variable grouping through software appear in the listing file.
! Works with Microchip's PICMASTER, ICE 2000 emulator, ! Supports 6808 extended addressing, instructions
Header files describe each processor derivative.
MPLAB-SIM simulator, Advanced Transdata, Tech-Tools ! Support for symbolic debugging with many emulators #pragma statements configure the compiler for available
Mathias, Clearview, iSystem including Motorola MMDS08 and MMEVS08, and the interrupts, memory resources, ports, and configuration
! Supports setting configuration fuses through C Ashling CT68HC08 registers. Convenient #defines make your programs
! Demo at www.bytecraft.com/impc.html ! Supports setting Mask Option Register through C portable between members of a processor family.
! Demo at www.bytecraft.com/i08.html
C extensions include: bit and bits data types, binary
constants, case statement extensions, direct register access
for DOS
or Windows COP8C
! Supports the Feature Family, and SGR/SGE
for DOS
or Windows SXC
! Supports all SX variants, including SX48 and SX52
in C, embedded assembly, initialization control, direct
variable placement, interrupt support in C.
! Supports LOCAL memory reuse, SPECIAL memory ! Supports LOCAL memory reuse, SPECIAL memory Two forms of linking are available: Absolute Code Mode
through software through software links library modules into the executable during
! Supports SREG memory management ! Supports virtual device drivers within C compilation. The BClink linker uses a more traditional
! Data types include bit, bits, char, short, int, linker command file and object files. Either route provides
! Support for symbolic debugging with emulators
optimization at final code generation.
including MetaLink int8/16/24/32, long, float and fixed point
! Supports setting configuration fuses through C ! Support for assembly source-level debugging with You can include Macro Assembler instructions within C
! Demo at www.bytecraft.com/icop.html Parallax SX-Key code, or as separate source files. Embedded assembly code
! Demo at www.bytecraft.com/isxc.html can call C functions and access C variables directly. You
can also pass arguments to and from assembly code.
Availability
C6805
! Supports all 68HC05 variants ! Supports all Zilog Z8 and Z8+ variants
Z8C Byte Craft Limited products are available world-wide, both
directly from Byte Craft Limited and through our
! Supports LOCAL memory reuse, SPECIAL memory ! Supports instruction set variants C94, C95, HALT, MUL, distributors. Demonstration versions of the Code
through software STOP, WAIT Development System are available.
! Support for symbolic debugging with many emulators ! Supports processor-specific instructions DI, EI, HALT,
NOP, RCF, SCF, STOP, WAIT, WDT, WDH For more information, see www.bytecraft.com.
including MMDS05, MMEVS, and Metalink iceMASTER
! E6805 available to support Motorola EVM, EVS ! Generates information required for source-level Upgrade Policy
! Supports setting Mask Option Register through C debugging Registered customers receive free upgrades and
! Demo at www.bytecraft.com/i05.html ! Demo at www.bytecraft.com/iz8c.html technical support for the first year. All other registered
users may purchase major releases for a fraction of the
full cost. Along with our version upgrades, Byte Craft
Limited remains committed to maintaining a high level of
C38
! Supports all MELPS740 variants, including 7600 series,
Fuzz-C™
! Transforms fuzzy logic to plain C; call between C and
technical support.
www.bytecraft.com