Вы находитесь на странице: 1из 84

THE ARCHITECTURE OF BI

1
A High-Level Architecture of BI
Data Warehouse Business Analytics Performance and
Environment Environment Strategy
Data Technical staff Business users Managers / executives
Sources Built the data warehouse Access
Data
ü Organizing Warehouse BPM strategy
ü Summarizing Manipulation
ü Standardizing Results

User Interface
Future component - browser
intelligent systems - portal
- dashboard

2
The Architecture of BI
• A BI system has four major components
1. A data warehouse, with its source data
2. Business analytics, a collection of tools for manipulating,
mining, and analyzing the data in the data warehouse
3. Business performance management (BPM) for monitoring
and analyzing performance
4. A user interface (e.g., dashboard)

3
Components in a BI Architecture
1. Data Warehouse
• Cornerstone of any medium-to-large BI
system.
• Originally, only historical data
• Today access to current data as well, so
they can provide real-time decision
support

4
Components in a BI Architecture
2. Business analytics
tools and techniques that help the user transform
data into knowledge
1. Reports and Queries
» Static, dynamic, multidimensional, drill-down
2. Advanced Analytics
 Financial data visualization , Web Analytics, GIS
3. Data, Text, Web mining and other sophisticated
mathematical/ statistical modeling
» Cross sell / up sell
» forensic audit in case of high profile wilful defaulters

5
6
Components in a BI Architecture
3. Business performance management (BPM) /CPM –
Corporate Performance Mgmt
An advanced performance measurement and
analysis approach that embraces planning and
strategy
– BPM extends the monitoring, measuring, and comparing
of sales, profit, cost, profitability, and other performance
indicators by introducing the concept of management and
feedback
– BPM provides a top-down enforcement of corporate-wide
strategy
7
Components in a BI Architecture
4. User Interface (i.e., dashboards)
• provide a comprehensive graphical/pictorial view of
corporate performance measures, trends, and
exceptions.

8
Data Warehousing

Definition and Concepts , Process


overview

9
9
A Simple Definition
A data warehouse is a collection of
data created to support decision-
making applications.

10
10
Data warehouse
A physical repository where relational data are
specially organized to provide enterprise-wide,
cleansed data in a standardized format

11
11
Data Warehouse Characteristics
• Subject oriented
– data -organized around sales, products, employees , etc.
• Integrated
– data - integrated to provide a comprehensive view
• Time variant (think time series)
– historical data is maintained
• Nonvolatile
– data is not updated by users

12
12
13
13
14
14
Three Types of Data Warehouse
• Data Marts
• Operational data stores (ODSs)
• Enterprise data warehouses (EDWs).

15
15
Data Mart
A departmental small-scale “DW” that
stores only limited/relevant data
 Dependent data mart
A subset that is created directly from a data
warehouse
 Independent data mart
A small data warehouse designed for a
strategic business unit or a department
16
16
Types of Datawarehouses
Enterprise Data Warehouse -
Operational data stores (ODS) EDW
• A type of database often • A data warehouse for the
used as an interim staging enterprise.
area for a data warehouse • Large scale
• CRM,SCM,BPM
• Short term decision making
–mission critical
applications

17
Other DW Components
Metadata
Data about data. In a data warehouse, metadata
describe the contents of a data warehouse and the
manner of its acquisition and use

18
18
19
19
20
Source: Inmon
20
Data Warehouse Vs. Data
Warehousing

21
21
A Generic DW Framework
No data marts option
Data Applications
Sources (Visualization)
Access
Routine
ERP Business
ETL
Reporting
Process Data mart
(Marketing)
Select

/ Middleware
Legacy Metadata Data/text
Extract mining
Data mart
(Engineering)
Transform Enterprise
POS Data warehouse
OLAP,
Integrate

API
Data mart Dashboard,
(Finance) Web
Other Load
OLTP/wEB
Replication Data mart
(...) Custom built
External
applications
data

22
22
Components of the Data Warehousing
Process
1. Data Sources 4. Comprehensive
– Legacy, External Data database -EDW
Providers 5. Metadata
2. Data extraction and 6. Middleware Tools
transformation – Enable access to the DW
– custom-written or – e.g. BO
commercial software
called ETL
3. Data loading

23
The Extraction, Transformation, and Load
(ETL) Process

Packaged Transient
application data source

Data
warehouse

Legacy
Extract Transform Cleanse Load
system

Data mart
Other internal
applications

24
24
Corrected Data
Parsed Data First Name: Beth
First Name: Beth Middle Name: Christine
Middle Name: Christine Last Name: Parker
Last Name: Parker Title: SLS MGR
Title: SLS MGR Firm: Regional Port Authority
Firm: Regional Port Authority Location: Federal Building
Location: Federal Building Number: 12800
Number: 12800 Street: South Butler Drive
Street: Lake Calumet City: Chicago
City: Hedgewisch State: IL
State: IL Zip: 60633
Zip+Four: 2398

25
Standardisation

26
Matching

27
Consolidation
Account No.
83451234 Policy No.
ME309451-2

Transaction
B498/97
Account No.
83451234 Policy No.
ME309451-2

Transaction
B498/97 28
Data Warehousing Architecture

29
29
DW Architecture
• Three-tier architecture
1. Data acquisition software (back-end)
2. The data warehouse that contains the data & software
3. Client (front-end) software that allows users to access and analyze
data from the warehouse
4. Easy to create datamarts
5. Functionally separate
• Two-tier architecture
First two tiers in three-tier architecture is combined into one
… sometimes there is only one tier?

30
30
DW Architectures

Tier 1: Tier 2: Tier 3:


Client workstation Application server Database server

Tier 1: Tier 2:
Client workstation Application & database server

31
31
REPRESENTATION OF DATA IN DW
X axis (periods 2001, 2003, and 2003), the Y axis (regions), and the 33
33
Z axis (products--front to back).
Data – Dimension

34
34
Dimensions

• What is a dimension ?
– Dimensions are perspectives with respect to
which an organization wants to keep record.
– products, salespeople, market segments, business
units, geographical locations, distribution
channels, country, or industry
• Time is usually an important dimension by default

35
Measures
• Measures are used to report the values of the
particular variable with respect to a given set
of dimensions.
 money, sales volume, head count, inventory profit, actual versus
forecast

36
Multidimensionality
The ability to organize, present, and analyze data by
several dimensions, such as sales by region, by product,
by salesperson, and by time (four dimensions)
• Multidimensional presentation
– Dimensions: products, salespeople, market segments,
business units, geographical locations, distribution channels,
country, or industry
– Measures: money, sales volume, head count, inventory
profit, actual versus forecast
– Time: daily, weekly, monthly, quarterly, or yearly
Representation of Data in DW
• Dimensional Modeling
– A retrieval-based system that supports high-volume query
access
• Star schema
– The most commonly used and the simplest style of
dimensional modeling
– Contain a fact table surrounded by and connected to
several dimension tables
• Snowflakes schema
– An extension of star schema where the diagram resembles
a snowflake in shape
Example of Star Schema
time
time_key item
day item_key
day_of_the_week Sales Fact Table item_name
month brand
quarter time_key type
year supplier_type
item_key
branch_key
branch location
location_key
branch_key location_key
branch_name units_sold street
branch_type city
dollars_sold province_or_street
country
avg_sales
Measures
39
39
Han: Data Cubes 39
Example of Snowflake Schema

time
time_key item
day item_key supplier
day_of_the_week Sales Fact Table item_name supplier_key
month brand supplier_type
quarter time_key type
year item_key supplier_key

branch_key
branch location
location_key
location_key
branch_key
units_sold street
branch_name
city_key city
branch_type
dollars_sold
city_key
avg_sales city
province_or_street
Measures country
40
40
ACCESSING DATA IN DW
Why Data Warehouse ?
• Everybody stores data in a database..

• Why should the organization have a data


warehouse ?

42
42
Data and Decisions
• Database • Data warehouse
– Throughput high for – Analysis / Analytics
transactional data.. – Higher order decision making
– IRCTC – IRCTC
• Who has booked tickets in the • Which route can benefit using
tatkal pricing ?
last week ?
• Which route is optimum for freight
– ICICI Bank carriers
• Does the Kelambakkam ATM – MakeMyTrip
have enough cash ? • Who can I cross sell an
international holiday to
– ICICI Bank
• What is the frequency at which
ATMs in different locations need to
be loaded?
• Optimal Routes
• Fraud Protection

43
43
Data and Decisions
• Apollo hospitals
– Demographic Profile of patients in the south
Chennai hospitals
– Geography and Asthma relationship
• How can the Government of India use a
datawarehouse ?

44
44
45
OLAP vs. OLTP
OLAP Operations

47
47
1. Slice
Slice is a subset of a
multidimensional array

Usually 2D

48
48
A 3-dimensional
OLAP cube with Sales volumes of

OLAP
slicing
operations
a specific Product
on variable Time
and Region

Slicing
Operations on a

e
m
Ti
Simple Three- Product

Dimensional

Geography
Cells are filled
Sales volumes of
Data Cube
with numbers
representing a specific Region
sales volumes on variable Time
and Products

Sales volumes of
a specific Time on
variable Region
and Products

49
49
2. Dice

Slice on more
than two
dimensions of a
data cube

50
50
Drill down

From higher level summary


to lower level summary
Detailed data, or
introducing new
dimensions

51
51
Roll up

Summarize data by
climbing up hierarchy or
by dimension reduction

52
52
Pivot
It is used to
change the
dimensional
orientation of a
report or an ad
hoc
53
53
Business Performance
Management

54
A High-Level Architecture of BI
Data Warehouse Business Analytics Performance and
Environment Environment Strategy
Data Technical staff Business users Managers / executives
Sources Built the data warehouse Access
Data
ü Organizing Warehouse BPM strategy
ü Summarizing Manipulation
ü Standardizing Results

User Interface
Future component - browser
intelligent systems - portal
- dashboard

55
Business Performance Management
(BPM) Definition
• What is it ?
– Framework that helps organizations deliver critical
insight to improve financial and operational
performance
• What does the framework specify ?
– Framework for organizing, automating, and
analyzing business methodologies, metrics,
processes, and systems

56
Business Performance Management
(BPM) Overview
• BPM and BI Compared
– BPM is an outgrowth of BI and incorporates many
of its technologies, applications, and techniques
– BPM is an enterprisewide strategy that seeks to
prevent organizations from optimizing local
business at the expense of overall corporate
performance
– BPM is part of the daily work of managers

57
Major BPM Processes
• Strategize
• Setting goals and objectives
• Plan
– Establishing initiatives and plans to achieve those
goals
• Monitor
– Monitoring actual performance against the goals
and objectives
• Act and Adjust
– Taking corrective action
58
Business Performance Management
(BPM) Overview
Strategize:
Where Do We Want to Go?
• Strategic planning
– Tasks common to the strategic planning process:
1. Conduct a current situation analysis
2. Determine the planning horizon
3. Conduct an environment scan
4. Identify critical success factors
5. Complete a gap analysis
6. Create a strategic vision
7. Develop a business strategy
8. Identify strategic objectives and goals
Strategize:
Where Do We Want to Go?
• Strategic planning
– Strategic objective
A broad statement or general course of action
prescribing targeted directions for an organization
– Strategic goal
A quantified objective with a designated time
period
Strategize:
Where Do We Want to Go?
• Strategic planning
– Strategic vision
– A picture or mental image of what the
organization should look like in the future
– Critical success factors (CSF)
Key factors that delineate the things that an
organization must excel at to be successful in its
market space
Strategize:
Where Do We Want to Go?
• The strategy gap
– Four sources for the gap between strategy and
execution:
1. Vision
2. People
3. Management
4. Resources
Plan:
How Do We Get There?
• Operational planning
– Operational plan
Plan that translates an organization’s strategic
objectives and goals into a set of well-defined
tactics and initiatives, resources requirements,
and expected results
Plan:
How Do We Get There?
• Operational planning
– Tactic-centric plan—tactics are established to
meet the objectives and targets established in the
strategic plan (used by best practices
organizations
– Budget-centric plan—a financial plan or budget is
established that sums to the targeted financial
values
Plan:
How Do We Get There?
• Financial planning and budgeting
– An organization’s strategic objectives and key
metrics should serve as top-down drivers for the
allocation of an organization’s tangible and
intangible assets
– Resource allocations should be carefully aligned
with the organization’s strategic objectives and
tactics in order to achieve strategic success
Monitor:
How Are We Doing?
• A comprehensive framework for monitoring
performance should address two key issues:
– What to monitor
– How to monitor
Monitor:
How Are We Doing?
• Diagnostic control system
– Inputs, a process for transforming inputs into
outputs, a standard against which to compare
these outputs, and a feedback channel allowing
information on variances to be communicated so
that it can be acted upon.
• Management by exception
– Don’t have to constantly monitor
– Scheduled and Exception reports

68
Monitor:
How Are We Doing?
Business
Strategy

Strategic
Objective

Strategic
Goal

Business Business Business Variance


Input Processes Output Analysis

Feedback

Figure 9.2 Diagnostic Control System


Monitor:
How Are We Doing?
• Pitfalls of variance analysis
– The vast majority of the exception analysis focuses
on negative variances when functional groups or
departments fail to meet their targets
– Rarely are positive variances reviewed for
potential opportunities, and rarely does the
analysis focus on assumptions underlying the
variance patterns
Act and Adjust:
What Do We Need to Do Differently?
• Hackett Group’s benchmarking process
divides planning and management reporting
into four subprocesses:
1. Strategic planning
2. Operational and financial planning
3. Reporting
4. Forecasting
Performance Dashboards
• Dashboards / Scorecards
– provide visual displays of important information
– consolidated and arranged on a single screen
– information can be digested at a single glance and
easily explored
Performance Dashboards
Scorecard
Major differences between a scorecard
and a dashboard
Characteristic Dashboard Scorecard
Purpose Measures Charts progress
performance
Users Supervisors, Executives,
specialists managers, staff
Updates “Right-time” Periodic
feeds snapshots
Data Events Summaries
Display Visual Visual graphs,
graphs, raw text comments
data
Tactical and Strategic Dashboards
• Operational Dashboard • Tactical dashboard
– frontline workers and – Managers and analysts
supervisors – track detailed and
– monitor key operational summarized data
data that are lightly – departmental processes
summarized and projects on a daily
– Updated frequently or weekly basis.
throughout the day.
Strategic Dashboard
• Strategic dashboard
– used by executives, managers, and staff
– monitor detailed and summarized data
– strategic objectives on a monthly or quarterly
basis.
Planning your Dashboard
• Research your requirements:
– Dashboard Purpose
– Audience
– KPI’s (Key Performance Indicators)
– Time Period
– Data Source
– Frequency & Delivery
Performance Measurement Systems
• Balanced Score Card (BSC)
– organization’s financial, customer, internal process,
and learning and growth objectives and targets
into a set of actionable initiatives
– Why is it “balanced”?
• Inclusion of non-financial measures
BPM Methodologies

82
BPM Methodologies

Strategy map
A visual display that delineates
the relationships among the key
organizational objectives for all four
BSC perspectives

83
BPM Methodologies
• Six Sigma
– More of a “process improvement” model
– The DMAIC performance model
A closed-loop business improvement model that
encompasses the steps of defining, measuring,
analyzing, improving, and controlling a process

Вам также может понравиться