Databases Management System

Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 40

DATABASES

MANAGEMENT SYSTEM

Lecture 1

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe


Course Information

 Course Organisation
 Classroom lectures and exercices
 Text book
 Elmasri and Navathe. Fundamentals of Database
Systems. Addison-Wesley, 5th edition, 2007.
 Slides will be made available

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe


Chapter 1
Introduction: Fundamental Concepts
of a Database System

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe


Introduction to Database Systems
 What Is a Database?
 can be viewed as a “repository for data” or “a collection of
data.”
 Implicit properties:
 It represents aspects of a real world.
 It is collection of coherent (related) data.
 It is designed, built and populated to address a specific situation in real
world.
 Database Management System (DBMS) ?
 is then a tool for creating and managing this large amounts of data
efficiently and allowing it to persist for a long periods of time.
 Hence, DBMS is a general-purpose software that facilities the processes of
 defining, constructing, manipulating, and sharing database.

4
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Cont’d
- Defining: involves specifying data types, structure and
constraints.
- Constructing: is the process of storing the data into a
storage media.
- Manipulating: is retrieving and updating data from
and into the storage.
- Sharing: allows multiple users to access data.
 The phrase “Database System” is used to colloquially
refer to database and database management system
(DBMS).
5
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Evolution of a Database System
 1st generation was file system, such as ISAM and VSAM.
 2nd generation was hierarchical database systems, such as
IMS(Information Management System) and System 2000.
 3rd generation was the network model CODASYL
(Conference on Data Systems Languages) database systems,
such as IDS(Integrated Data Store), TOTAL, ADABAS, IDMS,
etc.
 4th generation relational database technology.
 5th generation database technology will be characterized by a
richer data model and a richer set of database facilities.

6
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Database System Requirements

 Databases evolved to take responsibility for the data


away from the application, and most importantly to
enable data to be shared.
 Hence a database system must provide:
 Consistency
 Concurrency
 Performance
 Standard Adherence
 Security
 Reliability

7
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Cont’d
- Consistency: It must ensure that the data itself is not only
consistently stored but can be retrieved and shared efficiently.
- Concurrency: It must enable multiple users and systems to all
retrieve the data at the same time and to do so logically and
consistently.
- Performance: It must support reasonable response times.
- Standard adherence: It should support a standard language for
common understanding. Standard Query Language (SQL) has
to be supported.

8
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Cont’d
- Security: It should provide away to set access permissions
(much like files at the operating system level) and specific
database mechanisms such as triggers.
- Reliability: It must keep the stored data intact. Additionally, it
must cope well when things go awry and it must, if set up
properly, be able to recover to a known consistent point.

9
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Database System versus File
System(Traditional Approach)
 In the early days, database applications were built directly on top of file systems. The
traditional file processing system is file-directory structure supported by a
conventional operating system.
 Drawbacks of using file systems to store data:
 Data redundancy and inconsistency
 Multiple file formats, duplication of information in different files
 Difficulty in accessing data
 Need to write a new program to carry out each new task
 Data isolation — multiple files and formats
 Integrity problems
 Integrity constraints (e.g. account balance > 0) become “buried” in program
code rather than being stated explicitly
 Hard to add new constraints or change existing ones

10
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Database System versus File
System(Traditional Approach)
 Drawbacks of using file systems (cont.)
 Atomicity of updates
 Failures may leave database in an inconsistent state with partial updates
carried out
 Example: Transfer of funds from one account to another should either
complete or not happen at all
 Concurrent access by multiple users
 Concurrent accessed needed for performance
 Uncontrolled concurrent accesses can lead to inconsistencies
– Example: Two people reading a balance and updating it at the same time
 Security problems
 Hard to provide user access to some, but not all, data.
 Database systems offer solutions to all the above problems

11
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe
Why Databases?
 Scientists (biologists) have to manage huge quantities of
data
 Results of experiments
 References to relevant publications
 DNA sequences
 …
 Organizations have to manage huge quantities of data
 Student related data
 Financial Data,
 Patient data ,
 Weather Data,
 Customer data,
 …………….

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 12


Why Databases?
 Those data need to be stored in a consistent way, shared
and analysed
 Which are the experiments on a cellular biology done at my
lab in 2009?
 Which are the publications of my group?
 Which are the genes in the X chromosome?
 Databases are a possible solution to this issue

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 13


Database vs ad hoc programming
 Data can be managed and stored using ad-hoc programs
(e.g., in Java, c++, …)
 Need for writing ad hoc algorithms (e.g. search, sorting)
 Programs have to be changed when data change
 Need for concurrency control, backups, …
 Databases used to not re-solve the same problems every
time
 Standard (good) solutions to the most common problems
 Easy to use and to configure
 Standard programs used for specific tasks (e.g., scientific
computations) may rely on a database for data
management

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 14


Types of Databases and Database
Applications
 Traditional Applications:
 Numeric and Textual Databases
 More Recent Applications:
 Multimedia Databases
 Geographic Information Systems (GIS)
 Data Warehouses
 Real-time and Active Databases
 Many other applications
 We will focus on traditional applications, with
emphasis on scientific (biological) databases

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 15


Basic Definitions
 Database:
 A collection of related data.

 Data:
 Known facts that can be recorded and have an implicit meaning.

 Mini-world:
 Some part of the real world about which data is stored in a

database. For example, student grades and transcripts at a


university.
 Database Management System (DBMS):
 A software package/ system to facilitate the creation and

maintenance of a computerized database.


 Database System:
 The DBMS software together with the data itself. Sometimes, the

applications are also included.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 16


Simplified database system environment

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 17


Typical DBMS Functionality
 Define a particular database in terms of its data types,
structures, and constraints
 Construct or Load the initial database contents on a
secondary storage medium
 Manipulating the database:
 Retrieval: Querying, generating reports
 Modification: Insertions, deletions and updates to its content
 Accessing the database through Web applications
 Processing and Sharing by a set of concurrent users and
application programs – yet, keeping all data valid and
consistent

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 18


Typical DBMS Functionality
 Other features:
 Protection or Security measures to prevent
unauthorized access
 “Active” processing to take internal actions on data
 Presentation and Visualization of data
 Maintaining the database and associated
programs over the lifetime of the database
application

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 19


Example of a Database
(with a Conceptual Data Model)
 Mini-world for the example:
 Part of a UNIVERSITY environment.
 Some mini-world entities:
 STUDENTs
 COURSEs
 SECTIONs (of COURSEs)
 (academic) DEPARTMENTs
 INSTRUCTORs

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 20


Example of a Database
(with a Conceptual Data Model)
 Some mini-world relationships:
 SECTIONs are of specific COURSEs
 STUDENTs take SECTIONs
 COURSEs have prerequisite COURSEs
 INSTRUCTORs teach SECTIONs
 COURSEs are offered by DEPARTMENTs
 STUDENTs major in DEPARTMENTs

 Note: The above entities and relationships are typically


expressed in a conceptual data model, such as the
ENTITY-RELATIONSHIP data model (see Chapters 2)

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 21


Example of a simple database

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 22


Main Characteristics of the Database
Approach
 Self-describing nature of a database system:
 A DBMS catalog stores the description of a particular
database (e.g. data structures and types)
 The description is called meta-data.
 This allows the DBMS software to work with different
database applications.
 Insulation between programs and data:
 Called program-data independence.
 Allows changing data structures and storage organization
without having to change the DBMS access programs.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 23


Example of a simplified database catalog

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 24


Main Characteristics of the Database
Approach (continued)
 Data Abstraction:
 A data model is used to hide storage details
and present the users with a conceptual view
of the database.
 Programs refer to the data model constructs rather
than data storage details
 Support of multiple views of the data:
 Each user may see a different view of the
database, which describes only the data of
interest to that user.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 25


Main Characteristics of the Database
Approach (continued)
 Sharing of data and multi-user transaction
processing:
 Allowing a set of concurrent users to retrieve
from and to update the database
 Care is needed to avoid interferences
 Concurrency control within the DBMS guarantees
that each transaction is correctly executed or
aborted
 Recovery subsystem ensures each completed
transaction has its effect permanently recorded in
the database

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 26


Database Users
 Users may be divided into
 Those who actually use and control the database content, and

those who design, develop and maintain database


applications (called “Actors on the Scene”), and
 Those who design and develop the DBMS software and related

tools, and the computer systems operators (called “Workers


Behind the Scene”).

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 27


Database Users
 Actors on the scene
 Database Designers:

 Responsible to define the content, the structure, the constraints,


and functions or transactions against the database. They must
communicate with the end-users and understand their needs.
 Database administrators:
 Responsible for authorizing access to the database, for
coordinating and monitoring its use, acquiring software and
hardware resources, controlling its use and monitoring efficiency of
operations.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 28


Categories of End-users
 Actors on the scene (continued)
 End-users: They use the data for queries, reports and some of

them update the database content.


 End-users can be categorized into:

 Casual: access database occasionally when needed.


 Naïve or Parametric: they make up a large section of the end-user
population.
 They use previously well-defined functions against the database.
 Examples are bank-tellers or university secretaries who do this activity
for an entire shift of operations.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 29


Categories of End-users (continued)
 Sophisticated:
 These include business analysts, scientists, engineers, others
thoroughly familiar with the system capabilities.
 Many use tools in the form of software packages that work closely with
the stored database.
 Stand-alone:
 Mostly maintain personal databases using ready-to-use packaged
applications.
 An example is a scientists that creates a database for its own
experiments.
 You may become sophisticated or stand-alone users

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 30


Advantages of Using the Database
Approach
 Interacting easily with data using high-level dedicated tools
 Controlling redundancy in data storage and in development and
maintenance efforts.
 Sharing of data among multiple users.

 Ensuring consistency of data.

 Restricting unauthorized access to data.


 Providing storage structures (e.g. indexes) for efficient query
processing

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 31


Advantages of Using the Database
Approach (continued)
 Providing backup and recovery services.
 Providing multiple interfaces to different classes of users.
 Representing complex relationships among data.
 Enforcing integrity constraints on the database.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 32


Additional Implications of Using the
Database Approach
 Potential for enforcing standards:
 This is very crucial for the success of database applications in

large organizations. Standards refer to data item names, display


formats, screens, report structures, Web page layouts, etc.
 Reduced application development time:
 Applications can exploit the high-level database primitives.

 Incremental time to add each new application is reduced.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 33


Additional Implications of Using the
Database Approach (continued)
 Flexibility to change data structures:
 Database structure may evolve as new requirements are defined.

 Availability of current information:


 Extremely important for on-line transaction systems such as

airline, hotel, car reservations.


 Economies of scale:
 Wasteful overlap of resources and personnel can be avoided by

consolidating data and applications across departments.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 34


Historical Development of Database
Technology
 Early Database Applications:
 The Hierarchical and Network Models were introduced in mid

1960s and dominated during the seventies.


 A bulk of the worldwide database processing still occurs using

these models, particularly, the hierarchical model.


 Relational Model based Systems:
 Relational model was originally introduced in 1970, was heavily

researched and experimented within IBM Research and


several universities.
 Relational DBMS Products emerged in the early 1980s.

 Most of the systems used today are based on it (Microsoft

Access, MySQL, …)

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 35


Historical Development of Database
Technology (continued)
 Object-oriented and emerging applications:
 Object-Oriented Database Management Systems (OODBMSs)

were introduced in late 1980s and early 1990s to serve the


need of complex data processing in CAD and other applications.
 Their use has not taken off much.
 Many relational DBMSs have incorporated object database
concepts, leading to a new category called object-relational
DBMSs (ORDBMSs)
 Extended relational systems add further capabilities (e.g. for
multimedia data, XML, and other data types)

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 36


Historical Development of Database
Technology (continued)
 Data on the Web and E-commerce Applications:
 Web contains data in HTML (Hypertext markup language) with

links among pages.


 This has given rise to a new set of applications and E-

commerce is using new standards like XML (eXtended Markup


Language).
 Script programming languages such as PHP and JavaScript

allow generation of dynamic Web pages that are partially


generated from a database.
 Also allow database updates through Web pages

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 37


Extending Database Capabilities
 New functionality is being added to DBMSs in the following areas:
 Scientific Applications

 XML (eXtensible Markup Language)

 Image Storage and Management

 Audio and Video Data Management

 Data Warehousing and Data Mining

 Spatial Data Management

 Time Series and Historical Data Management

 The above gives rise to new research and development in


incorporating new data types, complex data structures, new
operations and storage and indexing schemes in database systems.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 38


When not to use a DBMS
 Main inhibitors (costs) of using a DBMS:
 High initial investment and possible need for additional
hardware.
 Overhead for providing generality, security, concurrency
control, recovery, and integrity functions.
 When a DBMS may be unnecessary:
 If the database and applications are simple, well defined,
and not expected to change.
 If there are stringent real-time requirements that may not be
met because of DBMS overhead.
 If access to data by multiple users is not required.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 39


When not to use a DBMS
 When no DBMS may suffice:
 If the database system is not able to handle the
complexity of data because of modeling limitations
 If the database users need special operations not
supported by the DBMS.

Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 40

You might also like