Ch6 - Data Warehouse in The The Age of Big Data
Ch6 - Data Warehouse in The The Age of Big Data
Ch6 - Data Warehouse in The The Age of Big Data
INTEGRATION
Lecturer: Nguyễn Văn Hồ, M.A.
Data Warehouse and Integration:
Data Warehouse in the the age of Big data
Nguyễn Văn Hồ, M.A.
[email protected]
Data Warehouse and Integration
De-normalized table structure (few tables, Normalized table structure (many tables,
many columns per table) few columns per table)
Processing 1 Source
1
record at a time
Data is highly “normalized”, i.e. Source
1
Source
involve many table joins 2
Source
32
Data Warehouse and Integration
Self- Agile,
Real-Time
Service Nimble
Reporting
BI Solutions
Data Warehouse and Integration
Achieving Extensibility in a DW
Design with change in mind. Ex: Create a lookup table with
code/descriptions, or implement in a view, rather than hard-coding in ETL.
Achieving Extensibility in a DW
Be prepared to do some refactoring along the way. Ex: converting an
attribute to be a conformed dimension.
DimCustomer
First implementation: FactSalesInvoice
CustomerName
CustomerRegion
…
Achieving Extensibility in a DW
Introducing new measures:
• Can be a new column in a fact table as long as it’s the same grain & the
same business process
Agility for the things that usually require the most time investment:
• Data modeling
• ETL processes
• Data quality
Data Warehouse and Integration
Achieving Extensibility in a DW
Reusability Downstream Speed of Change
Implemented
DW OLAP Reports