BI Analytics Overview
BI Analytics Overview
BI Analytics Overview
net/publication/327578485
CITATIONS READS
0 7,586
1 author:
Jack G Zheng
Kennesaw State University
41 PUBLICATIONS 343 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Jack G Zheng on 12 January 2020.
IT 4713/6713 BI
Jack G. Zheng
Spring 2020 (since V1 2012)
http://jackzheng.net/teaching/it4713/
http://jackzheng.net/teaching/it6713/
https://www.researchgate.net/publication/327578485
https://www.edocr.com/v/r4dg6mjr/
Overview
This lecture notes provides a high level overview of business
intelligence and analytics. This overview is comprehensive and covers
as many aspects as possible, but it keeps them at a high level. More
details are provided in more learning modules.
2
Types of Information Processing
For a more detailed comparison of OLTP and OLAP:
http://www.slideshare.net/fmhyudin/oltp-vs-olap-23317601
Transactional Processing
• Focus on data item processing
(data insertion, modification,
deletion), transmission, and Analytical Processing
non-analytical query • Focus on reporting, analysis,
transformation, and decision
support
3
DIKW
• The DIKW hierarchy depicts relationships between data, information,
knowledge (and wisdom).
– Data: raw value elements or facts
– Information: the result of collecting and organizing data that provides context and meaning
– Knowledge: the concept of understanding information that provides insight to information,
thus useful and actionable
Analytical
Processing
Transactional
Processing
• What if analysis
– If inventory levels are reduced by 10%, what is the new cost of inventory
storage?
• Fuzzy decision
– What new advertising strategies need to be undertaken to reach our
customers who can afford a high priced product?
– Should we invest more on our e-business?
5
What is Business Intelligence?
Business Intelligence is a set of methods,
processes, architectures, applications, and
technologies that gather and transform raw
data into meaningful and useful information
used to enable more effective strategic,
tactical, and operational insights and
decision-making.
Adapted from Forrester Report
“Topic Overview: Business Intelligence”, 2008
https://www.forrester.com/report/Topic+Overview+Business+Intelligence/-/E-RES39218
More BI from Forrester
https://www.forrester.com/business-intelligence
6
Data
• Different types of data
– Numeric vs. textual
– Structured vs. unstructured
– Standard format vs. proprietary format
– Internal vs. external data, system stored vs. file based data
– Raw fact data vs. simulated/forecast/estimated data
– Simple fact data vs. calculated metrics data
7
Decision Making
• Decisions can be made based on
– Facts, or data
– Simulation (models)
– Intuition, perception, sense
– Group negotiation
8
Additional Notes about BI
• BI is the an umbrella term for a set of methods, processes, applications, and
technologies used to
– gather, provide access to, analyze, and report data and information
– support understanding and decision making
– A common goal in BI is to drive performance
9
Evolution of BI
1980s Executive information systems (EIS), decision support systems (DSS)
The search for the perfect “business insight system”, from Performance Dashboard, by Wayne
Eckerson http://download.101com.com/pub/tdwi/files/performancedashboards.pdf
10
Analytics Depending on perspectives, Analytics can
• Analytics has emerged as a catch-all term for a • include BI
variety of different business intelligence (BI) and • be part of BI
application-related initiatives. … Whatever the use
cases, “analytics” has moved deeper into the • = (the new) BI
business vernacular.
– https://www.gartner.com/it-glossary/analytics/
11
Analytics or BI
• We tend to call analytics rather than BI in the
following scenarios. But their processes and
technologies are very similar.
• Non-business activities such as
– Learning analytics
– Talent analytics
– Web analytics
– Sports analytics
• Data science
– An interdisciplinary field about processes and systems to extract knowledge or insights from data in various forms
– Focus on advanced analytics and presentation models and methods
– Using autonomous or semi-autonomous techniques and tools, typically beyond traditional BI to discover deeper insights,
make predictions, or generate recommendation.
– A good data scientist = data hacker + programmer+ analyst+ coach+ story teller+ artist (http://analyticsindiamag.com/data-
science-the-most-desirable-job-in-the-21st-century/)
– “In some ways, data science is an evolution of BI.” https://www.linkedin.com/pulse/data-science-business-intelligence-whats-
difference-david-rostcheck/
• All these new terms try to differentiate them from the (traditional) BI. However, if one considers BI is a dynamic
and evolving field, then all these new terms are just extensions/expansions of BI; they all still fall under the
umbrella of the general BI.
– “In its more comprehensive usage, BI is all of the systems, platforms, software, technology, and techniques that are
essential for the collection, storage, retrieval, and analysis of data assets within a given organization.” – Dataversity 2015
Report on BI vs Data Science
13
BI/Analytics: A General Process
Data can be The process involves analytical
analyzed components, such as
immediately in dimensional analysis, statistical Results are presented
many agile and delivered in different
The organization and analytical cases, analysis, data mining, and other human comprehendible
transformation of data without a formal advanced analytics to extract formats, to support
into clean and common managed storage. information and knowledge. decisions. It also
models and formats. includes data exploration
and reporting.
Data Preparation
Queries can also directly
The collection of raw present results to users
data from different The refined data will be modeled
sources by different (if needed) and stored in a without intensive
means, and in different particular place (e.g., a file or a analysis. This is usually
formats. data management system) and used for data exploration
ready for analysis. and descriptive reports.
14
BI in the Decision Process
Another view from the corporate decision perspective
http://www.slideshare.net/junesungpark/business-process-based-analytics
15
General BI Capabilities Conception
This is consistent with the
general BI or analytics
process but more from an
information behavior angle.
Figure from: Business Intelligence, Rajiv Sabherwal, Irma Becerra-Fernandez, John Wiley & Sons, 2011
http://books.google.com/books?id=T-JvPdEcm0oC
16
BI Systems and Platforms
• A BI system is a computer information system that
implements (part or whole) BI capabilities and processes
• The values of BI Systems
– Provide an integrated data (analytical) processing platform
– Enable easy and fast access of data and information at all
levels (raw data, analysis results, metrics, etc.)
– Streamline a controlled and managed process of data driven
decision making
• Enterprise level vs. personal level
– An enterprise level BI system emphasizes more on control
and performance.
– While a more user-oriented analytics platform enables
nontechnical users to autonomously execute full-spectrum
analytic workflows from data access and preparation to
interactive analysis and the collaborative sharing of insights.
17
BI System Components at a Glance
* Data management
usually includes a
data sourcing and • Performance
gathering management
component. This
component may be • Benchmarking
Data integrated with or Applications • Market research
independent from a
Management: data storage system. • CRM
Gathering and Presentation • Strategic
management
Storage • Web site analytics
18
Critical Capabilities of a BI and Analytics Platform
Gartner Magic Quadrant Report 2018/2019
• Infrastructure
– BI Platform Administration. Capabilities that enable scaling the platform, optimizing performance and ensuring high availability and disaster recovery.
– Cloud BI. Platform-as-a-service and analytic-application-as-a-service capabilities for building, deploying and managing analytics and analytic
applications in the cloud, based on data both in the cloud and on-premises.
– Data Source Connectivity. Capabilities that allow users to connect to the data contained within various types of storage platforms.
• Data Management
– Governance and Metadata Management. Tools for enabling users to share the same systems-of-record semantic model and metadata. These should
provide a robust and centralized way for administrators to search, capture, store, reuse and publish metadata objects, such as dimensions, hierarchies,
measures, performance metrics/key performance indicators (KPIs) and report layout objects, parameters and so on.
– Self-Contained ETL and Data Storage. Platform capabilities for accessing, integrating, transforming and loading data into a self-contained storage
layer, with the ability to index data and manage data loads and refresh scheduling.
– Self-Service Data Preparation. The drag-and-drop, user-driven data combination of different sources, and the creation of analytic models such as
user-defined measures, sets, groups and hierarchies.
– Scalability and Data Model Complexity. The degree to which the in-memory engine or in database architecture handles high volumes of data,
complex data models, performance optimization and large user deployments.
• Sharing of Findings
– Embedding Analytic Content. Capabilities including a software developer's kit with APIs and support for open standards for creating and modifying
analytic content, visualizations and applications, embedding them into a business process, and/or an application or portal. These capabilities can reside
outside the application (reusing the analytic infrastructure), but must be easily and seamlessly accessible from inside the application without forcing
users to switch between systems.
– Publish and collaborate Analytic Content. Capabilities that allow users to publish, deploy and operationalize analytic content through various output
types and distribution methods, with support for content search, storytelling, scheduling and alerts.
19
A Practical System Architecture in MSBI
20
Data Management/Storage
• In traditional BI, a special database system called data warehouse or
data mart is often used to store enterprise data
– The purpose of a data warehouse is to organize lots of stable data for ease
of analysis and retrieval.
• Traditional BI focuses on upfront separate ETL processes that load the data in a
centralized storage. In modern BI and analytics, data cleanse and
transformation may happen just-in-time with analysis.
ETL will be covered in details in
milestone 2 (module 5 and 6).
22
Analysis Techniques
• Descriptive reporting
– Structured and fixed format reports
– Based on simple and direct queries
– Usually involves simple descriptive analysis and transformation of data,
such as calculating, sorting, filtering, grouping, and formatting
– Ad hoc query and reporting
• Business analytics
– Business analytics (BA) is the practice of iterative, methodical exploration of
an organization’s data with emphasis on statistical analysis.
23
OLAP OLAP will be covered in details in
millstone 3 (module 7 and 8).
26
Levels of Analytical Processing
A
27
Presentation is key – be
Presentation a master of PowerPoint.
28
Data Visualization
• Data visualization is the graphical representation and presentation of data for the purpose of
perception and understanding
• Data visualization in BI
– Data visualization is an important part of data exploration and decision making. Given the power of
visualization, it is only natural to apply the rich communication techniques in the field of BI and analytics.
– As organizations seek to empower non‐technical users to make data‐driven decisions, they must
consider the powers of data visualization in delivering digestible insights.
– Visualization tools have become increasingly important to business intelligence, in which people need
technology support to make sense of and analyze complex data sets and all types of information.
– Visualization can also be part of the analysis process (visual analytics)
Data visualization will be touched briefly in this course. For more coverage, take a look at
IT 7113 Data visualization http://zheng.kennesaw.edu/teaching/it7113 and the overview at
https://www.edocr.com/v/yqwmqeba/jgzheng/Business-Data-Visualization
29
Reports
• Reports
– A report is the presentation of detailed data arranged in defined layouts and formats
– Based on simple and direct queries: usually involves simple analysis and transformation of data (sorting, calculating,
filtering, filtering, grouping, formatting, etc.)
• Traditional reports contain detailed data in a tabular format and typically display numbers and text only.
– It is geared towards people who need data rather than a direct understanding or interpretation of data.
– Its purpose is mainly for printing (with styling) or exporting (raw data).
• Modern reports can be interactive and visual but the focus is still on detailed data. Sometimes the distinction is
a bit blurred with dashboards in some practical cases.
– A report style “dashboard” (or more like a visual intensive interactive report):
https://www.cityhealthdashboard.com/ga/atlanta/city-overview
– Magic Quadrant report vs. https://www.g2.com/categories/data-visualization?segment=all
– Dashboard or report? http://www.crazybikes.com/mrc/CRAZYBIKES.R00090s
30
Dashboard
A dashboard is a visual-oriented display of the most
important data and information needed to achieve defined
goals and objectives; consolidated and arranged on a
single screen so the information can be viewed at a glance.
Adapted from: Dashboard Confusion, Stephen Few,
http://www.perceptualedge.com/articles/ie/dashboard_confusion.pdf
• Elements of a dashboard
Dashboard = data/information + visual + UI
– Data/information: the most important element
– Visual: data visuals (charts, etc.) provide an high level at-a-glance view
– User interface
• a clean UI that unifies all elements to work together as a whole
• supporting interactions as needed
32
BI Users
Producers
vs.
Consumers
(at different levels)
33
Users Have Different Needs
34
The Fit between Tools and Users
Gartner Report,
Select the Right Business Intelligence and Analytics Tool for the Right User
Published: 23 May 2016 Analyst(s): Cindi Howson
35
BI Trends
http://www.b-eye-
36 network.com/blogs/eckerson/archives/2011/03/bi_market_evolu.php
The Modern/New BI
• A modern BI platform supports IT-enabled analytic content development. It is defined by a self-
contained architecture that enables nontechnical users to autonomously execute full-spectrum
analytic workflows from data access, ingestion and preparation to interactive analysis and the
collaborative sharing of insights. It moves from passive collection and use of data (reporting
driven) to proactive generation of data (business development driven).
• https://www.slideshare.net/Dataversity/analytics-business-intelligence-and-data-science-whats-
the-progression
Technology Insight for Modern Business Intelligence and Analytics Platforms
Gartner Report, October 2015
Analytic Workflow Component Traditional BI Platform Modern BI Platform
Data source Upfront dimensional modeling required (IT-built Upfront modeling not required (flat
star schemas) files/flat tables)
Content authoring Primarily IT staff, but also some power users Business users;
Analysis Predefined and regular reporting, based on Free-form exploration, ad hoc analytics
predefined model
Insight delivery Distribution and notifications via scheduled Sharing and collaboration, storytelling,
reports or portal; passive collection and use of open APIs
data (reporting driven).
37
Notable Trends/Features of the Modern BI
1. Self-service BI/Analytics: Business led, IT enabled
3. Embedded analytics: use of reporting and analytic capabilities directly in business applications http://www.gartner.com/it-glossary/embedded-
analytics/
4. Search driven analytics: (aka clickless analytics) aims to build a report and charts on the fly, using web search style.
– Incorporating natural language processing
– A quick intro: https://www.youtube.com/watch?v=868-pR-cxZo
5. Augmented analytics: uses machine-learning automation to supplement human intelligence across the entire analytics life-cycle.
• More trends
– https://bi-survey.com/top-business-intelligence-trends
– http://www.zdnet.com/article/is-the-business-intelligence-market-finally-maturing/
– https://www.slideshare.net/TableauSoftware/top-10-business-intelligence-trends-for-2017
– https://www.mrc-productivity.com/blog/2019/01/5-business-intelligence-trends-to-watch-in-2019/
38
Self-Service BI
• [A solution for] end users designing and deploying their own reports and analyses within an approved and
supported architecture and tools portfolio.
– http://www.gartner.com/it-glossary/self-service-business-intelligence
• Key features
– Shifting focus from IT back to user: enables all kinds of users with varied skill levels to autonomously execute full-spectrum
analytic workflows. These users include traditional power users, data professionals or data scientists, managers and
business analysts.
– A more distributed and collaborative environment.
– The process is more flexible and agile, and responds to user needs quickly. Supporting ad hoc analytic needs, hence more
interactive and explorative.
– Self-service BI tools still have fundamental BI components and provide BI capabilities, but they are more integrated (in one
software package) than separated.
– Independent but very often work with enterprise systems.
– Good for individuals or non-corporate environments.
• Dashboards, reporting, end-user self-service, and advanced visualization are the top four most important
technologies and initiatives strategic to BI in 2018.
– https://www.forbes.com/sites/louiscolumbus/2018/06/08/the-state-of-business-intelligence-2018/#b2fca2878289
• The global self-service business intelligence market to grow from USD 3963.04 million in 2016 to USD
10992.96 million by 2023, at a CAGR of 15.69%.
– http://www.nbc-2.com/story/38414064/global-self-service-business-intelligence-market-2018-size-share-growth-trends-type-
application-analysis-and-forecast-by-2023
39
IT Support in Self-Service BI
• The goal of self-service BI
– NOT to eliminate the need for IT
– Instead, to put data and results in the user’s hands and reduce the
burden on the IT department.
• “Self-service BI does remove much of the reporting burden from
the IT department. The IT department must control the data and
the user access. They’re responsible for keeping the data clean,
and ensuring that users can only access data they’re authorized
to see. The self-service BI tool only acts as a doorway for users
to access the IT-controlled data.”
– https://www.mrc-productivity.com/blog/2015/08/6-common-
misconceptions-of-self-service-bi/
• IT’s role
– Data management and governance, including security, access
control, data quality and accuracy, compliance, etc.
– Technical support for the systems and platforms, especially cloud
based
40
A Changing BI Platform
41
BI/Analytics Application Areas
• BI/Analytics can be applied in all “businesses” (industries,
functional areas, or domains) to drive “business” performance
– Companies (for profit) and financially related
• Retail, manufacture, real-estate, financial, sports, media, advertising,
entertainment, healthcare, publication, energy, etc.
– Public (non-profit)
• Organization, institution, association, community, etc.
– Government: citizen service, city planning, crime, immigration, etc.
– Personal: personal health, exercise, learning, eating, power
consumption, etc.
• BI can be applied at different levels
– Strategic: focused on high level organizational strategies and
directions
– Tactic: focused on goals of a organization unit
– Operational: focused on streamlining day-to-day operations.
– https://www.business2community.com/business-intelligence/the-
four-sides-of-business-intelligence-
0548311#ycaoYFUR04W76YiY.97
42
Sample BI/Analytics Applications
• Business management • IT management
– Strategic planning – Web analytics
– Performance management – App analytics
– Process intelligence – Security management
– Competitive intelligence
• Supply chain and Logistics
• Marketing and sales – Supplier and vendor management
– CRM – Shipping and inventory control
– Customer behavior analysis
– Targeted marketing and sales • Insurance
strategies • Government
– Customer profiling
– City planning
– Campaign management
– Traffic management
– Inventory management
– Urban Analytics
• Human resource/capital – Power usage
– HR analytics
• Education
– Talent management
– Learning analytics
• Project and program management – Student engagement and success
– Institutional effectiveness
• Power and energy management
• Social analytics
• Healthcare management
• Sports and games analytics
43
BI Market Major vendors
Worldwide Business
Intelligence and
Analytics Tools by
Vendor, 2016-2018
https://www.sas.com/en_u
s/news/press-
releases/2019/october/idc-
number-one-predictive-
analytics-ax19.html
https://www.appsruntheworld.com/
top-10-analytics-and-bi-software-
vendors-and-market-forecast/
Major Vendors/Products
• Mega vendors provide complete solutions that cover full spectrum of BI processes.
– Microsoft: SQL Server, Power BI, SharePoint, Excel
• https://www.microsoft.com/en-us/sql-server/
– SAP: SAP BusinessObjects BI, Lumira
• https://www.sap.com/products/analytics/business-intelligence-bi.html
– IBM: Cognos, Watson
• https://www.ibm.com/analytics/business-intelligence/
– Oracle: Oracle BI 12c
• https://www.oracle.com/solutions/business-analytics/business-intelligence/
– SAS: SAS Enterprise BI
• https://www.sas.com/en_us/solutions/business-intelligence.html
45
Vendor Positioning Notice this year Gartner
put analytics before BI.
https://www.g2.com/categories/business-intelligence-
platforms
46
BI/Analytics Careers
• Typical BI positions
– BI solution architects and integration specialists
– Business and BI analysts
– BI application developers and testers
– BI system support specialists
– Data warehouse specialists
– Database analysts, developers and testers
• BI jobs in Atlanta
– https://www.dice.com/jobs?q=Business+intelligence&l=A
tlanta%2C+Ga+Metro+Area
Critical Knowledge and Skills
• Three competencies
– Technical, Business (management), Analytical
• Technical knowledge
– Knowledge of database systems and data warehousing technologies
– Ability to manage database system integration, implementation and testing
– Ability to manage relational databases and create complex reports
– Knowledge and ability to implement data and information policies, security requirements, and state and federal regulations
– Knowledge of client tools used by business users
– Knowledge of data models
– Knowledge of programming tools used in analytics
• https://www.datapine.com/blog/bi-skills-for-business-intelligence-career/
Sample Roles (from real world job ads)
Business Intelligence Specialist Business Intelligence Developer
• Maintain or update business intelligence tools,
databases, dashboards, systems, or methods. • Business Intelligence Developer is responsible for
• Provide technical support for existing reports, designing and developing Business Intelligence
dashboards, or other tools. solutions for the enterprise.
• Create business intelligence tools or systems, • Key functions include designing, developing, testing,
including design of related databases, spreadsheets, debugging, and documenting extract, transform, load
or outputs. (ETL) data processes and data analysis reporting for
enterprise-wide data warehouse implementations.
• Responsibilities include:
Business Intelligence Analyst
– working closely with business and technical
teams to understand, document, design and
• Technical skill requirements code ETL processes;
– Works with business users to obtain data requirements
for new analytic applications, design conceptual and – working closely with business teams to
logical models for the data warehouse and/or data mart. understand, document and design and code data
– Develops processes for capturing and maintaining analysis and reporting needs;
metadata from all data warehousing components. – translating source mapping documents and
• Business skills requirements reporting requirements into dimensional data
– Transform data into analytical insight and desire to models;
leverage the best technique to arrive at the right answer.
– Generate standard or custom reports summarizing – designing, developing, testing, optimizing and
business, financial, or economic data for review by deploying server integration packages and
executives, managers, clients, and other stakeholders. stored procedures to perform all ETL related
– Analyze competitive market strategies through analysis functions;
of related product, market, or share trends. – develop data cubes, reports, data extracts,
– Collect business intelligence data from available dashboards or scorecards based on business
industry reports, public information, field reports, or
purchased sources. requirements.
– Maintain library of model documents, templates, or other • The Business Intelligence Report Developer is
reusable knowledge assets. responsible for developing, deploying and supporting
reports, report applications, data warehouses and
business intelligence systems.
49
BI/Analytics Education at KSU
• MSIT/BSIT
– IT 6713 Business Intelligence http://jackzheng.net/teaching/it6713/
– IT 7113 Data Visualization http://jackzheng.net/teaching/it7113/
– Certificate on data management and analytics
http://ccse.kennesaw.edu/it/programs/cert-dm.php
• BSIT
– The new concentration on “data analytics and technology” (starting fall
2020)
– IT 4713 Business Intelligence Systems http://jackzheng.net/teaching/it4713/
• Other departments
– Ph.D. in Analytics and Data Science https://datascience.kennesaw.edu
– ACS 8310 Data Warehousing
– IS 8935 Business Intelligence - Traditional and Big Data Analytics
– Certificate in High Performance Cluster Computing
http://ccse.kennesaw.edu/cs/programs/cert-hpcc.php
50
Core Readings
• A quick, more conceptual and practical introduction of BI by Jared Hillam (Intricity):
http://www.youtube.com/watch?v=LFnewuBsYiY
• BI intro video by LearnItFirst (focused more on the traditional BI; there are some good points
which I do agree): https://www.youtube.com/watch?v=LhZX0MAYKp8
• Others
– http://wps.prenhall.com/wps/media/objects/2519/2580469/addit_chmatl/TURBMC04_0131854615App.pd
f
– A Brief History of Decision Support Systems by D.J. Power:
http://dssresources.com/history/dsshistory.html
– Advanced Analytics and Business Intelligence: https://www.youtube.com/watch?v=oNNk9-tmsZY
– History of BI (casual video with wacky visuals): https://www.youtube.com/watch?v=_1y5jBESLPE
– https://www.datapine.com/blog/bi-skills-for-business-intelligence-career/
– https://www.1keydata.com/datawarehousing/datawarehouse.html
51
Good General BI/Analytics Resources
• General BI resource web sites
– BI and DW resource directory: http://www.bi-dw.info
– BeyeNetwork: http://www.b-eye-network.com
– https://solutionsreview.com/business-intelligence/
– DSS Resources: http://dssresources.com/
– ACM techpack: http://techpack.acm.org/bi/
– http://blog.capterra.com/learn-about-business-intelligence-resources/
– https://www.itprotoday.com/business-intelligence
• Paid industry reports: you may get some free reprints from some vendors after registration.
– Gartner annual report on “Magic Quadrant for Analytics and Business Intelligence Platforms”
– Gartner report “Technology Insight for Modern Analytics and Business Intelligence Platforms”
– The Forrester Wave™: Enterprise BI Platforms (two versions, one for on-premise and one for cloud)
– Forrester Playbook: https://www.forrester.com/playbook/The+InsightsDriven+Business+Playbook/-/E-PLA940