Data Warehousing
Data Warehousing
Data Warehousing
2
Data Warehousing: Process Overview
Organizations continuously collect data, information, and knowledge at an
increasingly accelerated rate and store them in computerized systems
3
Benefits of a Data Warehouse
Direct benefits of a data warehouse
Allows end users to perform extensive analysis
Allows a consolidated view of corporate data
Better and more timely information
Enhanced system performance
Simplification of data access
Indirect benefits result from end users using these direct benefits
Enhance business knowledge
Present competitive advantage
Enhance customer service and satisfaction
Facilitate decision making
Help in reforming business processes
4
Data Warehousing Characteristics
Basic (Inmon)
Subject oriented
Integrated
Time variant (time series)
Nonvolatile
Extra
Relational/multidimensional
Real-time
5
Types of data warehouse
Data Marts
Metadata
Data about data. In a data warehouse, metadata describe the contents of a
data warehouse and the manner of its use
7
Data Marts
8
ODS
Operational data stores (ODS) is a type of database
often used as an interim area for a data warehouse,
especially for customer information files
9
Data Warehousing Components
Middleware tools
software that acts as a bridge between an operating system or
database and applications, especially on a network.
10
Data Warehousing Architectures
Three parts of the data warehouse
Data acquisition (back-end) software that extracts data from legacy systems and
external sources, consolidates and summarizes them, and loads them into the data
warehouse
The data warehouse that contains the data and associated software
Client (front-end) software that allows users to access and analyze data from the
warehouse
11
EDW Architecture
12
Hub-and-spoke Architecture
14
The Problem
Operational Systems Decision Makers
Accts. Payable
Accts. Receivable Marketing
Payroll Sales
Purchasing Operations
Order Fulfillment
Inventory
Manufacturing
Inventory … Call Center …
Accts. Payable
Accts. Receivable Marketing
17
Data Integration and the Extraction,
Transformation, and Load (ETL) Process
Data integration
Integration that comprises three major processes:
1. Data access: the ability to access data from the source
2. Data federation: the integration of business views data
from various sources
3. Change capture: capture and delivery of the changes
made to the data sources
19
Data Integration and the Extraction,
Transformation, and Load (ETL) Process
20
Data Warehouse Development
21
DW Development best practice
Project must fit corporate strategy and business objectives
There must be complete buy-in to the project by executives, managers,
and users
It is important to manage user expectations about the completed project
The data warehouse must be built incrementally
The project must be managed by both IT and business professionals
Develop a business/supplier relationship
Only load data that have been cleansed and are of a quality understood
by the organization
Do not overlook training requirements
22
Real-Time Data Warehousing
Real-time (active) data warehousing
The process of loading and providing data via a data warehouse as
they become available
23
Real-Time Data Warehousing
24
Real-Time Data Warehousing
The need for real-time data
Business often cannot afford to wait a whole day for its operational data to load into the
data warehouse for analysis
Provides incremental real-time data showing every state change
Less costly to develop, maintain, and secure one huge data warehouse so that data are
centralized for BI/BA tools
1. Financial strength
2. ERP linkages
3. Qualified consultants
4. Market share
5. Industry experience
6. Established partnerships
26