Business Intelligence: Data Warehouse
Business Intelligence: Data Warehouse
Business Intelligence: Data Warehouse
Data warehouse
What is Business Intelligence?
Collection of data
Data integration
Storage of data
Data analysis
Distribution of data
Reaction based on insights
What is OLAP?
Online Analytical Processing (OLAP) is a category of software
that allows users to analyze information from multiple
database systems at the same time. It is a technology that
enables analysts to extract and view business data from
different points of view.
3 Data It manages all insert, update and It is mainly used for data reading
Modification delete transaction
5 Normalizatio Tables in OLAP database are not Tables in OLTP database are
n normalized. normalized.
example, there are three different application labeled A, B and C.
Information stored in these applications are Gender, Date, and
Balance. However, each application's data is stored different way.
In Application A gender field store logical values like M or F
In Application B gender field is a numerical value,
In Application C application, gender field stored in the form of a
character value.
Same is the case with Date and balance
However, after transformation and cleaning process all this data is
stored in common format in the Data Warehouse.
What is Data Mart?
A Data Mart is focused on a single functional area of an
organization and contains a subset of data stored in a Data
Warehouse. A Data Mart is a condensed version of Data
Warehouse and is designed for use by a specific
department, unit or set of users in an organization. E.g.,
Marketing, Sales, HR or finance. It is often controlled by a
single department in an organization.
Dependent data mart with operational data
store: a three-level architecture
Dependent Data Mart
Independent Data Mart
Hybrid Data Mart:
Data Warehouse - Schemas
• Star Schema
• Snowflake Schema
• Galaxy Schema
Star Schema
Star Schema in data warehouse, in which the center of the star
can have one fact table and a number of associated dimension
tables. It is known as star schema as its structure resembles a
star. The Star Schema data model is the simplest type of Data
Warehouse schema.
• Every dimension in a star schema is represented
with the only one-dimension table.
• The dimension table should contain the set of
attributes.
• The dimension table is joined to the fact table
using a foreign key
• The dimension table are not joined to each other
• Fact table would contain key and measure
• The Star schema is easy to understand and
provides optimal disk usage.
• The dimension tables are not normalized. For
instance, in the above figure, Country_ID does not
have Country lookup table as an OLTP design
would have.
• The schema is widely supported by BI Tools
Fact table
In data warehousing, a fact table consists of
the measurements, metrics or facts of a
business process. It is located at the center of
a star schema or a snowflake schema
surrounded by dimension tables. Where
multiple fact tables are used, these are
arranged as a fact constellation schema.
Snowflake Schema in data warehouse is a logical
arrangement of tables in a multidimensional
database such that the ER diagram resembles a
snowflake shape. A Snowflake Schema is an
extension of a Star Schema, and it adds additional
dimensions. The dimension tables are normalized
which splits data into additional tables.
A Galaxy Schema contains two fact table that
share dimension tables between them. The
schema is viewed as a collection of stars hence
the name Galaxy Schema.
Data Warehouse applications are designed to support the user ad-hoc data
requirements, an activity recently dubbed online analytical processing (OLAP). These
include applications such as forecasting, profiling, summary reporting, and trend
analysis