Вы находитесь на странице: 1из 22

Module / Topic Hours

Module1: Data Analytics and R


Unit 1 Introduction to Analytics and R programming
Introduction to R, RStudio (GUI): R Windows Environment

R as a Calculator and introduction to various data types,


Numeric, Character, date, data frame, array, matrix etc

Reading Datasets: Reading, Importing, Exporting and


writing to csv, text files
Reading Data from Different File types: .txt, .csv, Big data 3
tools like impala etc.

Creating new variabes and modifiying existing variables

Imputing Missing Data using standard methods and


algorithmic approaches (mice package in R)
Outliers
Combining Datasets: Appending and Concatenating.
rbind, cbind, pylr package in R etc
R functions and loops, for, ifelse etc
Unit 2 Summarizing Data and Revisiting Probability
Summary Statistics – Summarizing data with R
Probability
Expected Value
Random Variables
Bivariate Random Variables
Probability distribution
Distributions – Normal, Logistic, Uniform, Binomial,
Poisson etc.
Importance of Normal
Tests of Normal Distribution – Chi squared, Student’s T, F-
test
Central Limit Theorem
Random Walk
Application of Random Walk 3
Unit 3 Introduction to Time series Analysis
Basic Regression Analysis
Assumptions of OLS Regression
Regression Residuals
Regression Modeling
Correlation, ANOVA
Forecasting Using PROC REG
Heteroscedasticity – meaning, tests and fixes
3
Autocorrelation– What? Tests. Fixes.
Multicollinearity– What? Tests. Fixes.
Introduction to Multiple Regression
Dummy Variables
Unit 3 SQL using R
no SQL
Excel and R integration with R connector in Excel through
VBA
Unit 4 Understand the Verticals - Engineering, Financial and others

Understand systems viz. Engineering Design ,


Manufacturing, Smart Utilities, Production lines,
Automotive industry, Tech systems... Specifics..engine 3
related variables etc.

Understand the Business problems related to


Engineering , identify the critical issues. Set Business
objectives
Unit 5 Requirements Gathering
1.5
Gather all the data related to Business objective
Unit 6 Data Management
Design Data Architecture and manage the data for
analysis

Understand various sources of the data. Sensor data, 4.5


Signal processing, GPS data, Repair history (in case of
engines, mechinery parts..), Part failures, Un/Scheduled
Maintenanace etc.
Export all the data onto the Cloud like Amazon Web
Services S3
Unit 7 Big Data Tools
Introduction to the Big Data tools like Spark, Scala, Impala
etc for the streaming data handling
Data ETL process. Use the Big Data tools like Spark, Scala, 5
Impala etc for the streaming data handling
Identify gaps in the data and follow-up for decision
making
Unit 8 Big Data Analytics
Run Descriptives to understand the nature of the available
data
Collate all the data sources to suffice Business
requirement 7.5
Run the descriptive statistics for all the variables and
observe the data ranges
Outlier detection and elimination
Data Preprocessing for the analysis
Unit 9 Machine Learning Algorithms
Hypothesis testing and determining the multiple analytical
methodologies
4.5
Train Model on 2/3 sample data using various
Statistical/Machine learning algorithms, Test model on 1/3
sample for prediction
Explore the chosen algorithms for more accuracy
Unit 10 Data Visualization
3
Prepare the data for Visualization, Use Tableau tools etc
3

draw insights out of the Visualization tool


Unit 11 Product Implementation
2
Integrate the entire process into a Product
Total Hrs 40
Module 2: Predictive Analytics
Unit 1 Introduction
What and Why Analytics
Meaning and Purpose
Reporting vs. Analytics

How to bring data together from across the business

Skills needed for the new age of analytics


Impact of recession on business analytics
Introduction to Tools and Environment
Application of Analytics in various verticals
Introduction to Engineering and Financial Services
domain
Difference between tools and techniques
Techniques using various statistical tools
Application of Modeling in Business
What is modeling
Signal processing (Sensor data in Manufacturing) and
applications
Application of Modeling in Manufacturing domain 3
Results of modeling
Databases + Types of Data & Variables
Concept of Data Dictionary (participants need to
create one)
Introduction to Datatypes, and its description etc
Introduction to types of Variables in the
Manufacturing Domain
Data Modeling Techniques Overview
Introduction to Linear, OLS and Non Linear (ex:
Logistic) Regression
Cluster Analysis
Introduction to Time Series Modeling
Introduction to Factor and PCA regression
Missing Imputations
Introduction to imputation techniques (mice in R,
mean, median, mode)
A simple case study wherein the values of the
variables are missing and needs imputation
Unit 2 Linear Regression
Need for Business Modeling
Regression means going to the past. So what is it
doing in the business?
Examples of different verticals requiring prediction

How does one determine factors affecting a


business
Difference between BI and Predictive modeling
Relating the data available with the business
problem
How to effectively determine a business problem
and relate it to Analytics?
Regression - Concepts
The Regression problem
Finding the best-fit straight line - OLS Method
Sum of Square Errors
Base Line model
Co-efficient of Determination

Blue Property, Assumptions - Least Square estimation


Normality of residual
Homoscedasticity(White Test)
Autocorrelation of residual(Durbin Watson 7
Statistic)
Correlation between Independent variable and
residual etc.
Implications of Violation of BLUE properties
Overview of different methods to alleviate
heteroscedasticity
Box Cox Transformation
Residual Analysis, Its importance and Implications
Cookd statistic, standardised residual values etc.
Multiple regression and Collinearity
Tolerance
VIF
Variable Rationalization (based on distributions,
multicollinearity etc.)

Incorporating Categorical Data - Dummy variable coding


and interpretation of dummary variable coefficients
Model Building
Variable selection
Interpretation of the model
Prediction and scoring new data
R-Square
Adjusted R-Square
Performance tracking through Validation samples
Unit 3 Logistic Regression
Model Theory
The Classification problem
The joint probability prediction
Sigmoidal function
The logistic transformation

Liklihood function and Maximum Liklihood estimation

Difference between OLS & MLE


Variable selection
Interpretation of the model through maximum
liklihood estimates
Interpretation of the model through odds ratio
estimates
6
Variable Transformations / Derived Variables - in various
Domains (manufacturing, financial services etc.)
Introduction to Data cleaning including character to
numeric and vice versa
Introduction to classing
Deriving new variables from existing variables
Model fit Statistics
Concordants and Sommer's D
Hosmer Lemeshow Test
Error/Confusion matrix
Reciever operating characteristics

Model Conclusion - Tableau Visualization: various charts


(pie charts, line graphs, scatter graphs, bar charts, column
graphs, ring plots etc.)
Analytics application to various Domains 12
Planning and Estimation (e.g. engine health estimation,
consumer credit products)
Drivers of Asset/Engine risk categories analytics
Different approaches to Asset scoring models
Key factors determining the Asset scoring
Validation and maintainance of Asset scoring models
Unit 4 Objective Segmentation 4.5
Regression vs. Segmentation - Supervised and
unsupervised learning
CHAID AND CART

Nodes and Nodes splitting - Impurity measures - Gini


Index and Entropy, Information gain, decision tree
algorithms (Chaid and cart)
Tree Building - regression, classification, overfitting,
pruning and complexity
Multiple decision trees (comparison, performance and
tracking)
Unit 5 Time Series Methods/Forecasting, Feature Extraction 4.5
Univariate Stationary processes (ARMA) and
Forecasts
4.5

Univariate Non-Stationary, Integrated Processes


(ARIMA) and Forecasts
Measures of Forecast Accuracy
STL approach

Extract features from the generated model as Height,


Average, Energy etc and analyze for prediction

Project 3
Project Understanding
Understanding the current Engineering /
Manufacturing / Asset system
Creating the Business Understanding Document
Understanding Data and creation of Data Dictionary
Preparing the Data
Analysis and Modeling - which will include the
Classing Report, Variable Reduction report, Model
statistics
Final Recommendations
Project Documentation
Evaluation
Total Hrs 40
Module 3: Working with Documents
Standard operating procedures and guidelines for
documentation and knowledge sharing 1
Defining purpose and scope of documentation 1

Structure of documents - case studies, article, white papers,


technical reports, client reports, minutes of meeting,
policies, procedures, work instructions etc. 10

Publishing and printing process - online and print medium 2


Sources of information and reference material - online and
print 2
Style and format - House style, templates 4

Editorial Review - Review tools and Types of errors


(deviations, factual inaccuracy, linguistic mistakes,
discrepencies, errors, ambiguities, formatting) 6

Intellectual property and Copyright - Types of IP, Protection,


Sharing Permissions, open source / GNU licencing 3
Document preparation tools - Visio, powerpoint, MS-Word,
Excel, Pdf 5
Change management - version control, baseline, approval
process 2
Updating and accessing corporate knowledge base 2
Peer review and feedback 2
Total Hrs 40

Total Hours 120 Hours


Key Learning Outcomes

1.Establish clearly the objectives


and scope of the analysis
2. obtain guidance from appropriate
people to identify suitable data
sources to agree the methodological
approach
3. obtain and structure data using
standard templates and tools
4. validate data accurately and
identify anomalies
5. obtain guidance from appropriate
people on how to handle anomalies
in data
6. carry out rule-based analysis of
the data in line with the analysis
plan
7. validate the results of your
analysis according to statistical
guidelines
8. review the results of your analysis
with appropriate people
9. undertake modifications to your
analysis based on inputs from
appropriate people
10. draw justifiable inferences from
your analysis
11. present the results and
inferences from your analysis using
standard templates and tools
12. comply with your organization’s
policies, procedures and guidelines
when carrying out rule-based
quantitative analysis
The outcomes of 'module - Data
Analytics and R' will be enhanced to
Predictive analysis with special focus
on the outcomes: 1
how to apply rule-based statistical
analysis approaches 2. how
to obtain, analyze and use feedback
to improve the data
3. how to make inferences from
statistical analysis
4 types of predictions/conclusions
that can be made based on
statistical data
5. different graphical formats for
presenting data and how to create
these
6 different ways of presenting
information
7. the importance of providing
rationale for inferences
1. establish with appropriate people
the purpose, scope, formats and
target audience for the documents
2. access existing documents,
language standards, templates and
documentation tools from your
organization’s knowledge base
3. liaise with appropriate people to
obtain and verify the information
required for the documents
4. confirm the content and structure
of the documents with appropriate
people
5. create documents using standard
templates and agreed language
standards 6. review documents
with appropriate people and
incorporate their inputs
7. submit documents for approval
by appropriate people
8. publish documents in agreed
formats
9. update your organization’s
knowledge base with the
documents
10. comply with your organization’s
policies, procedures and guidelines
when creating documents for
knowledge sharing
when creating documents for
knowledge sharing
Duration Broad Topics Detailed Topics
4 Introduction Setting class expectations, Stage
introductions
Communication Process
Components of Effective Communication -
4 Communication Conviction, confidence & enthusiasm &
listening
Barriers to Communication
Facial Expressions,
Posture, Gestures,
4 Bobdy Language Eye contact,
Appearance (Dress Code)
Feedback on Day1 stage introductions
Parts of Speech,
Word order,
4 Grammar
Drafting of email
(classroom exercises)
Preparing & delivery of effective
presentations.
Power point presentations
6 Presentation Skills
Excel worksheet
(Classroom activity)
Importance of punctuality in corporates,
4 Time Management Meeting deadlines,
(playing videos)
Working in teams: Concept of Groups,
Group dynamics, classroom activity, Ability
6 Team
to work together.
(playing videos)
Importance of inter-personal skills,
5 Interpersonal Skills positive attitude, empathy and
(Role plays)
What is stress,
(videos to explain the concept),

5 Stress Management
Practicing different methods of relieving
stress

Establish and agree their work


requirements with appropriate people:
• Keep their immediate work area clean
and tidy
• utilize their time effectively
• Use resources correctly and efficiently
• Treat confidential information correctly
6 Managing Self and Work
• Work in line with organization’s policies
and procedures
• Work within the limits of their job role
6 Managing Self and Work

• Obtain guidance from appropriate


people, where necessary
• Ensure their work meets the agreed
requirements
• Communicate with colleagues clearly,
concisely and accurately
• Work with colleagues to integrate their
work effectively with them
• Pass on essential information to
colleagues in line
with organizational requirements
• Work in ways that show respect for
colleagues
• carry out commitments they have made
6 Teamwork and Communication
to colleagues
• Let colleagues know in good time if they
cannot carry out their commitments,
explaining the reasons
• Identify any problems they have working
with colleagues and take the initiative to
solve these problems
• Follow the organization’s policies and
procedures for working with colleagues
• Comply with their organization’s current
health, safety and security policies and
procedures
• Report any identified breaches in health,
safety, and security policies and
procedures to the designated person
• Identify and correct any hazards that
they can deal with safely, competently
and within the limits of their authority
4 Work , Health and Safety • Report any hazards that they are not
competent to deal with to the relevant
person in line with organizational
procedures and warn other people who
may be affected
• Follow their organization’s emergency
procedures promptly, calmly, and
efficiently
• Identify and recommend opportunities
for
improving health, safety, and security to
the designated person
• Establish and agree with appropriate
people the data/information they need to
provide, the formats in which they need to
provide it, and when they need to provide
it
• Obtain the data/information from reliable
sources
• Check that the data/information is
accurate, complete and up-to-date
• Obtain advice or guidance from
appropriate people where there are
problems with the data/information
• Carry out rule-based analysis of the
data/information, if required
4 Data Information and Management • Insert the data/information into the
agreed formats
• Check the accuracy of their work,
involving colleagues where required
• Report any unresolved anomalies in the
data/information to appropriate people
• Provide complete, accurate and up-to-
date data/information to the appropriate
people in the required formats on time
Candidates will be able to:
• Obtain advice and guidance from
appropriate people to develop their
knowledge, skills and competence

• Obtain advice and guidance from


appropriate people to develop their
knowledge, skills and competence
• Identify accurately the knowledge and
skills they need for their job role
• Identify accurately their current level of
knowledge, skills and competence and
any learning and development needs
• Agree with appropriate people a plan of
Learning and Self Development learning and development activities to
4
Development address their learning needs
• Undertake learning and development
activities in line with their plan
• Apply their new knowledge and skills in
the workplace, under supervision
• Obtain feedback from appropriate people
on their knowledge and skills and how
effectively they apply them, Review their
knowledge, skills and competence
regularly and take appropriate action

Creating effective resume,


Telephonic interviews,
4
Face 2 face,
Session on Interviewing Skills and Mock JAM
practice (Mock interviews),
4
Session on Interviewing Skills and Mock
practice
Body language of Interviewer and
4
interviewee, Group discussion etc.

6 CONCLUSION Review of what has been learnt and Q&A.

80 Hours Total Hours


Learning Outcome
Baseline Assessment

Students will be able to learn the


importance of communication aimed at
getting a their dream job

Students will learn the finer nuances of


Body language and see how to project
themselves during interviews and in
corporate life

Corporate expectations on drafting an


email and the do’s n don’t’s will be
explained to help them express
effectively

The students will learn how to give


corporate style presentations while
exuding confidence and communicating
effectively
By the end of the course, the students
will value time and know to excel over
others by managing time effectively in
a corporate environment
Only a team player wins in Corporates
and the importance will be explained
via team building activities
Students will learn how their attitude
defines their altitude and be a thorough
bred professional even before they step
into a corporate life

Students will learn that in corporate life


rewards comes with responsibilities and
sometimes at the cost of your health. A
balanced lifestyle will be explained to
achieve a long run in their career

Students will learn corporate


expectations on office hygiene
Students will learn corporate
expectations on office hygiene

Students will learn the value of


commitment at workplace and the role
on how policies and procedures help
them deliver within the corporate
horizon

Students will learn the pros n cons of


Students will learn the pros n cons of
confidentiality, the value of integrity
and other corporate values in tandem
with vision and mission of a company

Students will be exposed to various


corporate level trainings to enhance
their existing skill set to leverage their
growth within a company

Students will learn the techniques on


acing an interview as we do not want
them to simply crack it but pursue their
dreams

Students will by the end of this topic be


confident than before to take on any
interview in their stride
Students will by the end of this topic be
confident than before to take on any
interview in their stride

Assessment

Вам также может понравиться