Вы находитесь на странице: 1из 11

Business Analytics Course

Module 1: Business Analytics (Internal)


Basics of Big Data
Analytics Maturity
What is Data Science
Introduction to Hadoop
Analytical Tools: R vs SAS
What is Machine Learning
Module 2: Tools (Internal)
Introduction To R
o Data Import Techniques
o Analyze and Processing of Data
o Plotting function in R
o Data Sub setting: Indexing
o Functions in R
o Integrating R with Hadoop
Introduction To Tableau
Introduction To Python
SQL Queries
Module 3: Machine Learning (Internal)
Basics of ML: Supervised vs Unsupervised
Use Cases:
o Forecasting
o Classification
o Clustering
o Association Detection
o Anomaly/Outlier Detection
o Dimensionality Reduction
o NLP (Natural Language Processing)
o Image Detection & Deep Learning
Algorithm Basics:
o Regression (Linear & Logistic)
o Bayesian Methods (Nave)
o Nearest Neighbour
o SVM (Support Vector Machine)
o Neural Network & Preceptors
o Decision Tree, Forest & Jungle
o Artificial Neural Network
o Principal Component Analysis
o Time Series Modelling
Algorithm Cheat Sheet (Which Algorithm To Use When)
Module 4: Case Studies (External)

Prediction
o Demand Forecasting
o Price Prediction
Clustering o Market & Customer Segmentation
o Event/Activity based Segmentation
o Topic Modelling

Classification
o SPAM Detection
o Churn Analytics

Anomaly Detection
o Credit Risk Scoring
o Fraud Detection
o High Speed Arbitrage Trading

Association Detectiono Market Basket Analysis


o Next Best Offer Analysis
o Recommendation Engine (Products, Friend , Playlist)
o Cross Sell Model
o Customer Lifetime Value Analysis
o Sentiment Analysis

Dimensionality Reduction
o Genre classification
o Quant Trading

Neural Network
o Social Graph Analysis
o Quant Trading

Natural Language Processing


o Social Graph Analysis
o Topic Modeling
o Text Analysis
o Sentiment Analysis

Other
o
o
o
o
o
o
o
o
o

Cross Channel Analysis


Channel Optimization
Campaign management and Optimization
Discount Targeting
Event/Behaviour Based Targeting
Call Detail Record Analysis
Network Performance Optimization
Dynamic Pricing
Loyalty Program

Module 5: Data Visualization ((Internal)

Introduction
Data Visualization Process
Working with Single Data and Multiple Data Sources
Using Calculations in Tableau
Comparing Measures Against a Goal
Tableau Geo coding, Advanced Mapping
Showing Distributions of Data
Statistics and Forecasting
Dashboard Best Practices
Case Study

Module 6: Big Data & Hadoop (Internal)

Introduction To Hadoop Technology


Basics of Hadoop
HDFS and Cluster Environment
Map Reduce basics, Types and Formats
HIVE/PIG/HBASE/SPARK etc

Case Study 1: Customer


Segmentation
Every customer is different and so is their behaviour. To understand the vast and
diverse universe of the customer base, we need to segment the customers based
on their characteristics. These characteristics could be based on the
requirements, preferences, lifestyles or life stages.

Task 1: Design a model which can segment customer


into different groups
Data: Customer Orders Data_RFM.CSV
High level flow:
1. Calculate R,F and M parameters
2. Calculate Breadth & Tenure
3. Mosiac Plot for R vs F, M vs F, M vs R, Create a balloon Plot and show
different segment
a. Life Stage ( New, Active, Lapsed, Lost)
b. Value ( Gold, Silver, Bronze)
c. Combined as (High Value, Repeat, New, One-time, Lapsed, &
Lost)
4. Apply K-Mean clustering algorithm on these parameters to group
similar customers
5. Apply classification algorithms such as Logistic Regression and
Decision Trees to predict future customer behaviour.

Task 2: Also try to answer below questions


How is each customer segment different to other?
Which customers are likely to churn?
Which set of customers have potential to spend more?
Which products or services entice which customer group?
How seasonality does affect the customer behaviour?
What is the intersection between the customer loyalties with their
life stages?

RFM Analysis (Balloon Plot)

Task 3: Create a presentation which is DEMO ready with


below information

Problem Objective:
Approach:
o Data Description:
o Algorithm Used:
o Language/Tool Used:
Output:
Challenges:

Reference:
1. https://ds4ci.files.wordpress.com/2013/09/user08_jimp_custseg_revnov08.
pdf
2. http://inseaddataanalytics.github.io/INSEADAnalytics/Report_s45.html
3. http://analyzecore.com/2015/02/16/customer-segmentation-lifecycle-gridswith-r/

Case Study 2: Market Basket


Analysis
Market Basket Analysis (MBA), also known as affinity analysis, is a data mining
technique used to understand the purchase pattern of the shoppers. This
analysis unveils the hidden association between the products or the product
groups. This analysis is highly recommended for understanding cross-selling
opportunities in the associated categories as well as up-selling opportunities
within categories. Advanced Market Basket Analysis provides an excellent way to
get to know the customer and understand the different behaviours.

Task 1: Design a model which can segment customer into


different groups and try to answer below additional
questions

Summary of Purchase behaviour


Develop a cross promotional programs
How this analysis can help redesigning store layout/design
Prepare a discount plans / promotions

Data:
http://archive.ics.uci.edu/ml/datasets/Online+Retail
Grocery.csv

Task 2: Create a presentation which is DEMO ready with


below information
Problem Objective:
Approach:
Data Description:
Algorithm Used:
Language/Tool Used:
Output:
Challenges:
Reference: https://gallery.cortanaintelligence.com/Experiment/Frequentlybought-together-market-basket-analyses-using-ARULES-1

Case Study 3: Customer Churn


Analytics
Today, a retailer needs to keep track of the rate of attrition in their customer
base. Acquiring new customers is far more costly than retaining existing
customer as retaining customers maintains volume and revenues at lower cost,
increases customer lifetime value, and should create loyalty from limited
experience and habit. Customer Churn analysis help organizations identify key
factors driving churn and to design customer churn prevention and win back
strategies to improve customer retention, loyalty and Life time value.

Task 1: Create a Churn Analytics model which predict


the Churn and try to answer below question

What are my key Churn drivers?

How many customers am I going to lose by next month?

How can I develop a customer win back strategy?

Data:
UserInfoData.CSV : User Information Data
UserActivityData : User Activity Data

High Level Flow:

Reference: https://gallery.cortanaanalytics.com/Collection/Retail-Customer-ChurnPrediction-Template-1

Case Study 3: Pricing


Optimization
High competition, consumer price sensitivity and pressure on margins are the
key challenges that a retailer faces every day. Pricing optimization has been
shown time after time to increase sales and margins; it is one of the most direct
routes between analytics and the bottom line. It is designed to enable retailers to
optimize base prices to achieve their sales, volume, profit and price image
objectives for regular, everyday items.
Data: Point-of-sale information for seasonal and non-seasonal data
Create a model which can forecast a set of demand curves for particular SKUs in
particular stores or clusters. The demand curve identifies the products that are
the most and least price-sensitive.

Also try to answer these additional questions

From these predicted


recommended pricing.

How can I price my brands to increase the overall portfolio growth?

What is the impact of competitive price on my brands?

Which products or brands are highly price sensitive?

What are the right price points to maximize the sales?

What is the right mix of Price discount and marketing activity?

demand

curves

how

to

determine

optimal

Case Study 4: Marketing Mix


The old complaint that, Half of the money I spend on advertising is wasted; the
trouble
is
I
dont
know which half no longer has to be true.

Through marketing mix modelling, it is determined where and how much the
retailers
should
spend
their marketing resources. We use econometric models, along with the deep
understanding
of
your
market and industry, to measure the performance of your marketing mix. Our
optimization
techniques are flexible and are specific to the business challenge; hence
maximize
sales
and
produces highest return for your marketing investments.

Key Features:

Measuring Effectiveness Determine which media vehicles are most effective


at
driving
sales, profits, share and consumer segments
Understanding sensitivity Understanding whether the macroeconomic,
demographic,
competitive, seasonal, weather and operational factors impact on sales and
shares
ROI Calculation ROI is calculated for each advertising and promotional
campaign,
including halo effects and cannibalization on related brands
Recommendations The results of our marketing mix modeling tells us whether
a
marketing vehicle needs to continue or discontinued and how to reallocate the
marketing
spends for maximizing marketing ROI.

Case Study 5: Price Prediction


Tubes can vary across a number of dimensions, including base materials, number
of bends, bend radius, bolt patterns, and end types.

Currently, Caterpillar relies on a variety of suppliers to manufacture these tube


assemblies, each having their own unique pricing model. This competition
provides detailed tube, component, and annual volume datasets, and challenges
you to predict the price a supplier will quote for a given tube assembly.

Data:
Sample_submission.csv
Data.csv

The dataset is comprised of a large number of relational tables that describe the
physical properties of tube assemblies. You are challenged to combine the
characteristics of each tube assembly with supplier pricing dynamics in order to
forecast a quote price for each tube. The quote price is labeled as cost in the
data.
High Level Flow:

Reference: https://www.kaggle.com/c/caterpillar-tube-pricing

Вам также может понравиться