Distributed Systems Characterization and Design
Distributed Systems Characterization and Design
Distributed Systems Characterization and Design
OUTLINE
1. What is a Distributed System 2. Examples of Distributed Systems 3. Common Characteristics 4. Basic Design Issues 5. Summary
Control
Fully Distributed
Autonomous fully cooperative Autonomous transaction based Master-slave Homog. Homog. general special purpose purpose Heterog. Heterog. special general purpose purpose
Local data, local directory Not fully replicated master directory Fully replicated
Processors
3
Definition: A distributed system is one in which components located at networked computers communicate and coordinate their actions only by passing messages. This definition leads to the following characteristics of distributed systems: Concurrency of components Lack of a global clock Independent failures of components
One component with non-autonomous parts Component shared by users all the time All resources accessible
Desktop computers
Web serv er
em ail s erv er File serv er print other s erv ers the res t of the Internet router/firewall
8
10
2.4 INTERNET
intranet % % ISP % %
backbone
2.4.1 WORLD-WIDE-WEB
12
Browsers
http://www.uu.se/
www.w3c.org File system of www.w3c.org http://www.w3c.org/Protocols/Activity.html Protocols
Activity.html
13
Host intranet
Wireless LAN
GSM/GPRS gateway
Home intranet
Printer Camera
Host site
14
3. COMMON CHARACTERISTICS
What are we trying to achieve when we construct a distributed system? Certain common characteristics can be used to assess distributed systems Heterogeneity Openness Security Scalability Failure Handling Concurrency Transparency
15
3.1 HETEROGENEITY
Variety and differences in Networks Computer hardware Operating systems Programming languages Implementations by different developers Middleware as software layers to provide a programming abstraction as well as masking the heterogeneity of the underlying networks, hardware, OS, and programming languages (e.g., CORBA). Mobile Code to refer to code that can be sent from one computer to another and run at the destination (e.g., Java applets and Java virtual machine).
16
3.2 OPENNESS
Openness is concerned with extensions and improvements of distributed systems. Detailed interfaces of components need to be published. New components have to be integrated with existing components. Differences in data representation of interface types on different processors (of different vendors) have to be resolved.
17
3.3 SECURITY
In a distributed system, clients send requests to access data managed by servers, resources in the networks:
Doctors requesting records from hospitals Users purchase products through electronic commerce Concealing the contents of messages: security and privacy Identifying a remote user or other agent correctly (authentication) Denial of service attack Security of mobile code
New challenges:
18
3.4 SCALABILITY
19
Hardware, software and networks fail! Distributed systems must maintain availability even at low levels of hardware/software/network reliability. Fault tolerance is achieved by
recovery redundancy
20
3.6 CONCURRENCY
Components in distributed systems are executed in concurrent processes. Components access and update shared resources (e.g. variables, databases, device drivers). Integrity of the system may be violated if concurrent updates are not coordinated.
21
3.7 TRANSPARENCY
Distributed systems should be perceived by users and application programmers as a whole rather than as a collection of cooperating components. Transparency has different aspects.
22
software engineering principles include rigor and formality, separation of concerns, modularity, abstraction, anticipation of change, Specific issues for distributed systems:
Naming Communication Software structure System architecture Workload allocation Consistency maintenance
31
4.1 NAMING
A name is resolved when translated into an interpretable form for resource/object reference.
Communication identifier (IP address + port number) Name resolution involves several translation steps
Design considerations
Choice of name space for each resource type Name service to resolve resource names to comm. id.
Name services include naming context resolution, hierarchical structure, resource protection
32
4.2 COMMUNICATION
Separated components communicate with sending processes and receiving processes for data transfer and synchronization. Message passing: send and receive primitives
Communication patterns: client-server communication (e.g., RPC, function shipping) and group multicast
33
Applications
Middleware
Operating system
34
36
Client
Serv er
Serv er
37
38
Serv er Client
Serv er
Client Serv er
39
Web serv er
Client
Web serv er
40
Web serv er
Client
Applet
Web serv er
41
Network computer or PC
Thin Client
network
Application Process
42
5. SUMMARY
Definitions of distributed systems and comparisons to centralized systems. The characteristics of distributed systems. The eight forms of transparency. The basic design issues. Read Chapter 1 and Chapter 2 of the textbook.
43