Data Warehousing
Data Warehousing
Data Warehousing
nonvolatile updateable
History of DATA
WAREHOUSING
concept dates back to the late
1980s when IBM researchers Barry
Devlin and Paul Murphy developed
"business data warehouse".
In essence, data warehousing
concept was intended to provide an
architectural model for the flow of
data from operational systems
to decision support environments
In absence of a data warehousing
architecture, an enormous amount of
redundancy of information was required to
support the multiple decision support
environments that usually existed.
personal productivity
budgeting,
forecasting,
product line and customer profitability,
sales analysis,
financial consolidations
manufacturing mix analysis
--applications that use historical, projected and derived
data.
Benefits of Data
Warehousing
Has a subject area orientation
Data is arranged by subject rather than by
application, and is more intuitive for users to
navigate.
closer in concept to the way decision makers
think about their business.
Architecture,
in the context of an
organization's data warehousing efforts,
is a conceptualization of how the data
warehouse is built.
METADATA LAYER
The data directory -This is often usually more detailed than an
operational system data directory. There are dictionaries for the entire
warehouse and sometimes dictionaries for the data that can be
accessed by a particular reporting and analysis tool.
Data Warehouse
Components
LOAD MANAGEMENT
relates to the collection of info. from disparate internal or external
sources.
loading process includes summarizing, manipulating and changing data
structures into format that lends itself to analytical processing.
Actual raw data kept alongside or within data warehouse enabling
construction of new and different representations.
WAREHOUSE MANAGEMENT
relates to day-to-day management of data warehouse.
management tasks associated with warehouse includes ensuring its
availability, the effective backup of its contents & security.
QUERY MANAGEMENT
relates to provision of access to contents of warehouse
includes partitioning of info. into different areas with different
privileges to different users.
Access may be provided through custom-built applications, or ad hoc
query tools.
Technologies Involved In Data
Warehousing
Key advantage :
easier for the user to understand and to use.
the retrieval of data from the data warehouse tends to operate very
quickly.
disadvantages
Key advantage
it is straightforward to add information into the database.
disadvantage
because of no. of tables involved, difficult for users to
1) Join data from different sources into meaningful information
2) Access the information without a precise understanding of
sources of data & data structure of the data warehouse.