GCC QB

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 16

CS6703 GRID & CLOUD COMPUTING

UNIT I INTRODUCTION
PART-A
1. What are the computing Paradigm Distinctions?
Centralized computing
Parallel Computing
Distributed Computing
Cloud Computing
2. What is meant by Centralized Computing?
This is a computing paradigm by which all computer resources are centralized in one
physical system.
All resources (processors, memory, and storage) are fully shared and tightly coupled
within one integrated OS.
3. What is meant by Parallel Computing?
In parallel computing, all processors are either tightly coupled with centralized shared
memory or loosely coupled with distributed memory.
Interprocessor communication is accomplished through shared memory or via message
passing.
A computer system capable of parallel computing is commonly known as a parallel
computer.
Programs running in a parallel computer are called parallel programs. The process of
writing parallel programs is often referred to as parallel programming.
4. What is meant by Distributed computing?
A distributed system is a network of autonomous computers that communicate with each
other in order to achieve a goal.
The computers in a distributed system are independent and do not physically share
memory or processors. They communicate with each other using messages, pieces of
information transferred from one computer to another over a network.
5. What is meant by Cloud Computing?
An Internet cloud of resources can be either a centralized or a distributed computing
system.
The cloud applies parallel or distributed computing, or both. Clouds can be built with
physical or virtualized resources over large data centers that are centralized or distributed.
6. What are the various degrees of parallelism?
Bit level parallelism (BLP)
Instruction Level Parallelism (ILP)
Data Level Parallelism (DLP)
Task Level Parallelism (TLP)
Job Level Parallelism (JLP)
7. What are the applications of High Performance and High-Throughput Systems?

8. What is meant by High Throughput Computing (HTC)?


High-throughput computing (HTC) describes the use of many computing resources over
long periods of time to accomplish a computational task. By using distributed computing
enables lots of jobs to be scheduled to available resources to complete as fast as possible.
9. What is meant by High Performance Computing (HPC)?
High-performance computing (HPC) is the use of super computers and parallel processing
techniques for solving complex computational problems. HPC technology focuses on
developing parallel processing algorithms and systems by incorporating both administration
and parallel computational techniques.
10. What is Internet of Things (IoT) ?
IoT refers to the networked interconnection of everyday objects, tools, devices or computers.
Each object is assigned an IP an address(IPv6) to distinguish from one another.
11. What are the 3Cs of Cyber Physical System?
CPS integrates Cyber (heterogenous, asynchronous) with physical(concurrent and
information dense) objects. A CPS merges the 3C technologies of Computation,
Communication and Control.
12. What is meant by Multicore processor ?
A multi-core processor is an integrated circuit (IC) to which two or more processors have
been attached for enhanced performance, reduced power consumption, and more efficient
simultaneous processing of multiple tasks.
13. What is Graphic Processing Unit (GPU) ?
A Graphics Processing Unit (GPU) is a single-chip processor primarily used to manage and
boost the performance of video and graphics. GPU features include
2-D or 3-D graphics
Digital output to flat panel display monitors
Texture mapping
Application support for high-intensity graphics , etc
These features are designed to lessen the work of the CPU and produce faster video and
graphics.
A GPU is not only used in a PC on a video card or motherboard; it is also used in mobile
phones, display adapters, workstations and game consoles.
14. What are the different forms of Multithreaded Processor?

4-issue superscalar processor


Fine grained multithreaded Processor
Coarse grained multithreaded Processor
Simultaneous Multithreaded (SMT) Processor

15. What is the difference between Storage area network (SAN) and Network attached
Storage (NAS)?
A SAN connects servers to network storage such as disk arrays where as NAS connects
clients hosts directly to the disk arrays.
16. What is a hypervisor?
A hypervisor, also called a virtual machine manager, is a program that allows multiple
operating systems to share a single hardware host. Each operating system appears to have the
host's processor, memory, and other resources all to itself.
17. What are the types of hypervisor?
Type 1 hypervisors run directly on the system hardware. They are often referred to as a
"native" or "bare metal" or "embedded" hypervisors in vendor literature.
Type 2 hypervisors run on a host operating system.

18. What is a Virtual Machine (VM)?


A virtual machine (VM) is a software program or operating system that not only exhibits the
behavior of a separate computer, but is also capable of performing tasks such as running
applications and programs like a separate computer. A virtual machine, usually known as a
guest is created within another computing environment referred as a "host." Multiple virtual
machines can exist within a single host at one time. A virtual machine is also known as a
guest.

19. What is a grid system?

Interconnected computer systems where the machines utilize the same resources collectively.
Grid computing usually consists of one main computer that distributes information and tasks
to a group of networked computers to accomplish a common goal. Grid computing is often
used to complete complicated or tedious mathematical or scientific calculations.

20. How is grid Systems classified?


Grid Systems are classified into two categories. They are Computational or data grids and
P2P grids.
Data grid: A data grid is a set of structured services that provides multiple services like the
ability to access, alter and transfer very large amounts of geographically separated data,
especially for research and collaboration purposes. Data from different regions are pulled
from administrative domains which filter data for security purposes, and present it to the user
upon request by means of a middleware application.
P2P Systems: Every node acts as both client and a server, providing part of the system
resources. Peer machines are simply client computers connected to the Internet. All client
machines act autonomously to join or leave the system freely.

21. List out P2P Application Families ?

22. What are the differences between Grid computing and cloud computing?
Grid computing
Cloud computing
What?
Grids enable access to shared Clouds enable access to
computing power and storage capacity leased computing power and
from your desktop
storage capacity from your
Who provides the Research institutes and universities Large
desktop individual companies
service?
federate their services around the e.g. Amazon and Microsoft.
world.

Who
uses
service?

the Research
collaborations,
called
"Virtual Organizations", which bring
together researchers around the world
working in the same field.
Who pays for the Governments - providers and users are
service?
usually publicly funded research
organizations.

Small to medium commercial


businesses or researchers with
generic IT needs
The cloud provider pays for
the computing resources; the
user pays to use them

23. What is Service Oriented Architecture (SOA)?


A service-oriented architecture is essentially a collection of services. These services
communicate with each other. The communication can involve either simple data passing or
it could involve two or more services coordinating some activity.
24. List out the access model for organizing a data grid?
Monadic Model
Hierarchical Model
Federation Model
Hybrid Model
25. List out the Resources to perform grid computing?
Compute Resources

Storage Resources

Network Resources

Code Repositories
Service Catalogs

26. State High Performance Computing (HPC)?

HPC: use of parallel processing to execute large programs quickly;


often equated to supercomputers, typically applied to systems generating teraflops
(1012) or more. Emphasis: raw speed performance and accuracy

27 State High Throughput Computing (HTC)?

HTC: running a job that takes days to complete, or an application that must
produce a high number of completed operations per unit of time. Performance
measured in flops per month or year, as opposed to per second.eg high-flux
computing is in Internet searches and web services, Emphasis: batch processing
speed, cost, energy savings, security, and reliability.

28. Define Moores law?

Moores law indicates that processor speed doubles every 18 months.

29. What are computational grids?

A computational grid is hardware and software infrastructure that provides


dependable, consistent, pervasive and inexpensive access to high-end computational
capabilities.

30. What is meant by grid infrastructure?

Grid infrastructure is a complex combination of a number of capabilities


and resources identified for the specific problem and environment being addressed. It
forms the core foundations for successful grid applications.

31. What are the derivatives of grid computing?


There are 8 derivatives of grid computing. They are as follows:

a) Compute grid

b) Data grid

c) Science grid

d) Access grid

e) Knowledge grid

f) Cluster grid

g) Terra grid

h) Commodity grid

32. What are the grid computing applications?

Application partitioning that involves breaking the problem into discrete


pieces. Discovery and scheduling of tasks and workflow. Data communications
distributing the problem data where and when it is required.

33. Define Cluster computing:

Cluster is local to the domain and constructed to solve inadequate


computing power. It is related to the pooling of computational resources to provide more
computing power by parallel execution of the workload.

34. Define SOA.

Service oriented architecture is intended to define loosely coupled and


interoperable services applications, and to define process for integrating these
interoperable components. In SOA, the system is decomposed into a collection of
network-connected components. Applications and resources within a SOA shouldnt
be built as a tightly coupled monolithic model. Rather; these applications are
composed dynamically from the deployed and available services in the network.

35. What are the business areas needs in Grid computing?

Life Sciences

Financial services

Higher Education

Engineering Services

Government

Collaborative games

36. Write the any three Grid Applications.

Schedulers

Resource Broker

Load Balancing

37. What are the two most important technologies for building semantic webs?

XML

Resource Description Framework(RDF)

38. What is meant by Virtual Machines?

Virtual machines (VMs) offer novel solutions to underutilized resources,


application inflexibility, and software manageability, and security concerns in
existing physical machines.

39. What are the business benefits in Grid Computing?

Acceleration of implementation time frames in order to intersect with the


anticipated business end results.

Improved productivity and collaboration of virtual organizations and respective


computing and data resources.

Allowing widely dispersed departments and business to create virtual organizations


to share data and resources.

40. What are the areas are difficult to implement in Grid Computing Infrastructure?

A Grid computing infrastructure component must address several potentially


complicated areas in many stages of the implementation. These areas are
Security

Resource management

Information services

Data management

41. Give the different layers of grid architecture.

Fabric Layer: Interface to local resources

Connectivity Layer: Manages Communications

Collective Layer: Coordinating Multiple Resources

Application Layer: User-Defined Application.

42.What do you mean by Grid Services?

Grid services are stateful Web services. The service itself maintains some
state information and it exposes a set of standard interfaces to enable interactions with its
client.

43. Define Hype Cycle

This cycle shows the expectations for the technology at five different

stages. The expectations rise sharply from the trigger period to a high peak of
inflated expectations.Through a short period of disillusionment, the expectation may
drop to a valley and then increase steadily over a long enlightenment period to a
plateau of productivity. The number of years for an emerging technology to reach a
certain stage is marked by special symbols.

44. Difference between CPU and GPU ?

A Graphics Processing Unit (GPU) is a special purpose processor, optimized


or calculations commonly (and repeatedly) required for Computer Graphics,
particularly
SIMD
operations.
A Central Processing Unit (CPU) is a general purpose processor - it can in
principle do any computation, but not necessarily in an optimal fashion for any given
computation. One can do graphics processing on a CPU - but it likely will not
produce the result anywhere nearly as fast as a properly programmed GPU.

45.what is meant by NAS and SAN ?

A storage area network (SAN) connects servers to network storage such as


disk arrays. Network attached storage (NAS) connects client hosts directly to the
disk arrays.

PART B

1.
Explain all the evolutionary changes in the Age of Internet Computing.
The Age of Internet Computing
High-performance computing (HPC)
High-throughput computing (HTC)
The Platform Evolution
Computer technology has gone through five generations of development, with each
generation lasting from 10 to 20 years
High-Performance Computing
The speed of HPC systems has increased from Gflops in the early 1990s to now
Pflops in 2010.
High-Throughput Computing
This HTC paradigm pays more attention to high-flux computing. The main
application for high-flux computing is in Internet searches and web services by
millions or more users simultaneously.
Three New Computing Paradigms
The maturity of radio-frequency identification (RFID), Global Positioning System
(GPS), and sensor technologies has triggered the development of the
Internet of Things (IoT).
Computing Paradigm DistinctionsThe high-technology community has
argued for many years about the precise definitions of centralized computing,
parallel computing, distributed computing, and cloud computing.
Centralized computing this is a computing paradigm by which all
computer resources are centralized in one physical system. All resources (processors,
memory, and storage) are fully shared and tightly coupled within one integrated OS.
Many data centers and supercomputers are centralized systems, but they are used in
parallel, distributed, and cloud computing applications
Parallel computing in parallel computing, all processors are either
tightly coupled with centralized shared memory or loosely coupled with distributed
memory. Some authors refer to this discipline as parallel processing. A computer
system capable of parallel computing is commonly known as a parallel computer.
Programs running in a parallel computer are called parallel programs. The process
of writing parallel programs is often referred to as parallel programming
Distributed computing a distributed system consists of multiple
autonomous computers, each having its own private memory, communicating
through a computer network. Information exchange in a distributed system is
accomplished through message passing. A computer program that runs in a
distributed system is known as a distributed program. The process of writing
distributed programs is referred to as distributed programming.
Cloud computing An Internet cloud of resources can be either a
centralized or a distributed computing system. Or utility computingor service
computing Ubiquitous computing refers to computing with pervasive devices at any

place and time using wired or wireless communication. The Internet of


Things (IoT) is a networked connection of everyday objects including computers,
sensors, humans, etc.
Distributed System Families
The system efficiency is decided by speed, programming, and energy factors
Meeting these goals requires to yield the following design objectives:
Efficiency measures the utilization rate of resources in an execution
model by exploiting massive parallelism in HPC.
Dependability measures the reliability and self-management from
the chip to the system and application levels.
Adaptation in the programming model measures the ability to
support billions of job requests over massive data sets and virtualized cloud
resources under various workload and service models.
Flexibility in application deployment measures the ability of
distributed systems to run well in both HPC and HTC applications.
2. Explain about Scalable Computing Trends and its New Paradigms. What do
you mean by the Internet of Things and Cyber-Physical Systems? Discuss.
Scalable Computing Trends and New Paradigms
Moores law indicates that processor speed doubles every 18 months.
Gilders law indicates that network bandwidth has doubled each year in the past.
Degrees of Parallelism
bit-level parallelism (BLP) converts bit-serial processing to word-level processing
gradually This led us to the next wave known as instruction-level parallelism
(ILP)Data-level parallelism (DLP) was made popular throughSIMD (single
instruction, multiple data) and vector machines using vector or array types of
instructions. From chip multiprocessors (CMPs), we have been exploring task-level
parallelism (TLP).
Innovative Applications
Both HPC and HTC systems desire transparency in many application aspects
Applications of High-Performance and High-Throughput Systems
The Trend toward Utility Computing
These paradigms are composable with QoS and SLAs (service-level agreements).
The Hype Cycle of New Technologies
This cycle shows the expectations for the technology at five different stages. The
expectations rise sharply from the trigger period to a high peak of inflated
expectations.
The Internet of Things and Cyber-Physical Systems
The Internet of Things
Three communication patterns co-exist: namely H2H (human-to-human), H2T
(human-to-thing), and T2T (thing-to-thing
Cyber-Physical Systems

A cyber-physical system (CPS) is the result of interaction between computational


processes and the physical world.
3. Write in detail about Clusters of Cooperative Computers.
Clusters of Cooperative Computers
Cluster Architecture
The cluster is connected to the Internet via a virtual private network (VPN) gateway.
The gateway IP address locates the cluster. Most clusters have loosely coupled node
Computers. All resources of a server node are managed by their own OS.
Single-System Image
Cluster designers desire a cluster operating system or some middleware to support
SSI at various levels, including the sharing of CPUs, memory, and I/O across all
cluster nodes.
Hardware, Software, and Middleware Support
Special cluster middleware supports are needed to create SSI or high availability
(HA).
Both sequential and parallel applications can run on the cluster, and special parallel
environments are needed to facilitate use of the cluster resources.
Major Cluster Design Issues
Unfortunately, a cluster-wide OS for complete resource sharing is not available yet.
Critical Cluster Design Issues and Feasible Implementations
Features Functional Characterization Feasible Implementations
4. Explain in detail about Grid Computing Infrastructures. Discuss about
Cloud Computing over the Internet.
Grid Computing Infrastructures
Computational Grids
A computing
grid offers
an
infrastructure
that
couples
computers,
software/middleware, special instruments, and people and sensors together. The grid
is often constructed across LAN, WAN, or Internet backbone networks at a regional,
national, or global scale. They can also be viewed asvirtual platforms to
support virtual organizations.
Grid Families
Grid technology demands new distributed computing models, software/middleware
support, network protocols, and hardware infrastructures. National grid projects are
followed by industrial grid platform development by IBM, Microsoft, Sun, HP, Dell,
Cisco, EMC, Platform Computing, and others.
Cloud Computing Over the Internet

A cloud is a pool of virtualized computer resources. A cloud can host a variety of


different workloads, including batch-style backend jobs and interactive and userfacing applications.
Internet Clouds
Cloud computing leverages its low cost and simplicity to benefit both users and
providers.
The Cloud Landscape
Infrastructure as a Service (IaaS) this model puts together infrastructures
demanded by usersnamely servers, storage, networks, and the data center fabric.
The user can deploy and run on multiple VMs running guest OSes on specific
applications. The user does not manage or control the underlying cloud
infrastructure, but can specify
When to request and release the needed resources.
Platform as a Service (PaaS) This model enables the user to deploy user-built
applications onto a virtualized cloud platform. PaaS includes middleware, databases,
development tools, and some runtime support such as Web 2.0 and Java. The
platform
Includes both hardware and software integrated with specific programming
interfaces.
Software as a Service (SaaS) this refers to browser-initiated application software
over thousands of paid cloud customers. The SaaS model applies to business
processes,
Industry applications, consumer relationship management (CRM), enterprise
resources
Planning (ERP), human resources (HR), and collaborative applications.
Internet clouds offer four deployment modes: private, public, managed,
andhybrid these modes demand different levels of security implications. The
following list highlights eight reasons to adapt the cloud for upgraded Internet
applications and web services:
1. Desired location in areas with protected space and higher energy efficiency
2. Sharing of peak-load capacity among a large pool of users, improving overall
utilization
3. Separation of infrastructure maintenance duties from domain-specific application
development
4. Significant reduction in cloud computing cost, compared with traditional
computing
Paradigms
5. Cloud computing programming and application development
6. Service and data discovery and content/service distribution
7. Privacy, security, copyright, and reliability issues
8. Service agreements, business models, and pricing policies
5. Illustrate Service-Oriented Architecture (SOA).

Service-Oriented Architecture (SOA)


In grids/web services, Java, and CORBA, an entity is, respectively, a service, a Java
object, and a CORBA distributed object in a variety of languages. These
architectures build on the traditional seven Open Systems Interconnection (OSI)
layers that provide the base networking abstractions.
Layered Architecture for Web Services and Grids
The entity interfaces correspond to the Web Services Description Language(WSDL),
Java method, and CORBA interface definition language (IDL) specifications. These
interfaces are linked with customized, high-level communication systems: SOAP,
RMI, and IIOP. These communication systems are built on message-oriented
middleware infrastructure such as Web Sphere MQ or Java Message Service (JMS)
which provide rich functionality and support virtualization of routing, senders, and
recipients. In the case of fault tolerance, the features in the Web Services Reliable
Messaging (WSRM) framework mimic the OSI layer capability modified to match
the different abstractions
At the entity levels. Security is a critical capability that either uses or re implements
the capabilities seen in concepts such as Internet Protocol Security (IPsec) and
secure sockets in the OSI layers. The CORBA Trading Service, UDDI (Universal
Description, Discovery, and Integration), LDAP (Lightweight Directory Access
Protocol), and ebXML (Electronic Business using eXtensible Markup Language)
Web Services and Tools
Both web services and REST systems have very distinct approaches to building
reliable
Interoperable systems. This specification is carried with communicated messages
using
Simple Object Access Protocol (SOAP). The hosting environment then becomes a
universal distributed operating system with fully distributed capability carried by
SOAP
Messages. REST can use XML schemas but not those that are part of SOAP; XML
over HTTP is a popular design choice in this regard.
The Evolution of SOA
Filter services (fs in the figure) are used to eliminate unwanted raw data, in order to
respond to specific requests from the web, the grid, or web services.
Grids versus Clouds
THE general approach used in workflow, the BPEL Web Service standard, and
several important workflow approaches including Pegasus, Taverna, Kepler, Trident,
and Swift. May end up building with a system of systems: such as a cloud of clouds,
a grid of clouds, or a cloud of grids, or inter-clouds as a basic SOA architecture.
6. What are the Grid Standards?
Standards bodies that are involved in areas related to grid computing include:
Global Grid Forum (GGF)

Organization for the Advancement of Structured Information Standards (OASIS)


World Wide Web Consortium (W3C)
Distributed Management Task Force (DMTF)
Web Services Interoperability Organization (WS-I)

OGSA
The Global Grid Forum has published the Open Grid Service
Architecture (OGSA). OGSA defines requirements for these core capabilities and
thus provides a general reference architecture for grid computing environments. It
identifies the components and functions that are useful if not required for a grid
environment.
OGSI
The Global Grid Forum extended the concepts defined in OGSA to
define specific interfaces to various services that would implement the functions
defined by OGSA. A Grid service is a Web service that conforms to a set of
interfaces and behaviours that define how a client interacts with a Grid service.OGSI
provides the Web Service Definition Language (WSDL) definitions for these key
interfaces.
OGSA-DAI
The OGSA-DAI (data access and integration) project is concerned
with
Constructing middleware to assist with access and integration of data from separate
data sources via the grid.

GridFTP
GridFTP is a secure and reliable data transfer protocol providing high
performance and optimized for wide-area networks that have high bandwidth.
GridFTP uses basic Grid security on both control (command) and data channels.
Features include multiple data channels for parallel transfers, partial file transfers,
third-party transfers, and more.
GridFTP can be used to move files (especially large files) across a network
efficiently and reliably.

WSRF

WSRF defines a set of specifications for defining the relationship


between Web services and stateful resources. WSRF is a general term that
encompasses several related proposed standards that cover:
Resources
Resource lifetime
Resource properties
Service groups (collections of resources)
Faults
Notifications
Topics

XML
WSDL
SOAP
UDDI

Web services related standards


Standards commonly associate with Web services are

You might also like