Вы находитесь на странице: 1из 5

Data Science

Data Mining

Business Intelligence

Data Visualization and


Summarization

Data Accessing

Chapter-1

Part-1 Descriptive Statistics:

Introduction to Advanced Data Analytics


Statistical inferences Types of Variables
Measures of central tendency
Dispersion
Variable Distributions
Probability
Distributions
Normal Distribution and Properties

Part-2: Data quality outlier

Robust measurements
Outlier treatment with central tendency
Replacing with series means or median values
Z score Calculation
Data Normalization
Sampling and estimation

Null/Alternative Hypothesis formulation


Type I and Type II errors
One Sample TTEST
Paired TTEST
Independent Sample TTEST
ANOVA,
MANOVA
Chi Square Test

Overview of 9 Business Intelligence Platform and Data


Integration Study

Chapter-2

Working Under Change Management


What is change management?
Repository types
Using change management

Data Management
Chapter-3

Part-3: Test of Hypothesis

Data Import and Export


Save and Load R Data
Import from and Export to CSV
Files
Import Data from SAS
Import/Export via ODBC
Read from Databases

Data Exploration
Explore Individual Variables
Explore Multiple Variables
More Explorations
Save Charts into Files
Decision Trees and Random
Forest
Decision Trees with Package
party
Decision Trees with Package rpart

Predictive Modeling

Linear Regression
Logistic Regression
Generalized Linear Regression
Non-linear Regression
J48
Decision Stump
The k-Means Clustering
Hierarchical Clustering

Designing the Data Mart


Planning a data warehouse
Course data for the Orion Star company
Orion Star data models

Chapter-4

Building a Data Mart


Review of the case study
Defining the source data
Defining the target tables
Loading the target tables

Chapter-5

Administering Data Integration Studio


Setup tasks for Data Integration Studio
Setting up change management
Security planning and implementation (self-study)

Kruskal-Wallis,Mann-Whitney,
Wilcoxon,
McNemar test

Data Preparation and Quality Check


Part-4: Data Validation & Imputation

Univariate procedure
Q-Q probability plots
Cumulative frequency (P P) plots
Explorer analysis
Steam and leaf analysis
Kolmogorov Smirnov test
Shapiro Wilks test

Part-5: Data Transformation

Log transformation (s)


Arcsine transformation
Box- Cox transformation
Square root transformation
Log transformation (s)
Inverse transformation
Min- Max Normalization

Predictive Analytics
Part-6: Predictive modeling & Diagnostics

Correlation Pearson, Kendall


SLR Regression
MLR Regression
Residual analysis
Auto Correlation
VIF Analysis

Density-based Clustering
Association Rules
Basics of Association Rules
Visualizing Association Rules
Discussions and Further Readings
Transforming Text
Stemming Words
Building a Term-Document Matrix
Frequent Terms and Associations
Word Cloud
Network of Terms
Data partition (Training, Validating
Testing)
Data Explore
Data Testing
Data Transform
SVM Model
Tree Analysis

Chapter-6

Chapter-7

Model Evaluation

ROC
Lift Curve
Sensitivity
Confusion matrices
Precession
Sensitivity
Prediction Score
Cross Validation
MAPE
MAE
AIC
BIC
Residual

Introduction to Online Analytical Processing


Overview of OLAP Cube Studio
Registering metadata

Working with Transformations


Working with the extract transformation
Working with the data validation transformation
Working with the apply lookup standardization
transformation
Working with the sort transformation
Working with the append transformation
Working with an analysis transformation
Working with the transpose transformation
Working with the transformation generator wizard

Chapter-8

Working with OLAP Cubes Derived from Star


Schemas
Creating a OLAP cube from a star schema
Viewing a OLAP cube with Web OLAP Viewer for
JAVA

Chapter-9

Monitoring and Tuning a OLAP Cube


Using Application Response Measurement (ARM) for
tuning
Building an information map from a OLAP cube
Creating a report with Web Report Studio

Indexing Eigen Value interpretation


Homoscedasticity
Homogeneity
Stepwise regression
Transformation of variables

Chapter-10

Part-7 Logistic Regression Analysis

Discriminant and Logit Analysis


Multiple Discriminant Analysis
Stepwise Discriminant Analysis Binary
Logit Regression
Estimation of probability using logistic regression,
Wald Test
Hosmer Lemshow

Advanced Analysis
Part-8: Factor Analysis

Introduction to Factor Analysis PCA


Reliability Test
KMO MSA tests, Eigen Value Interpretation
Rotation and Extraction
Varimix Models
Principle component analysis
Conformity Factor Analysis
Exploitary Factor Analysis

Part-9: Cluster Analysis

Introduction to Cluster Techniques


Distance Methodologies,
Hierarchical and Non-Hierarchical Procedures K
Means clustering

Overview of the Add-In for Microsoft Office


Introduction to the course data
Introduction to the course scenarios

Chapter-11

Exploring the Add-In for Microsoft Office


Using the Add-In for Microsoft Excel
Using the Add-In for Microsoft Word and Microsoft
PowerPoint
Using the Add-In to publish documents (self-study)

Chapter-12

Exploring the Add-In for Microsoft Office


Using the Add-In for Microsoft Excel
Using the Add-In for Microsoft Word and Microsoft
PowerPoint
Using the Add-In to publish documents (self-study)

Chapter-13

Analyzing Data with Tasks in Microsoft Office


Overview of tasks
List data task
One-way frequencies task
Table analysis task
Summary table task
Bar chart task

Chapter-14

Using the Add-In for Microsoft Office


Overview of the Add-In for Microsoft Office

Wards Method

Using the Add-In for Microsoft Office


Solutions to Exercises

Part- 10: Conjoint Analysis


Chapter-15

Statistics and terms Association with Conjoint


Analysis
Assumption and limitation of conjoint analysis
Hybrid Conjoint Analysis

Part 11: Time Series Forecasting

Smoothing and annual Time series


Time series forecasting for seasonal data
Multiplicative Models
Additive Models

Data Mining for Business Intelligence


Part -12: Data Mining

Data partition (Training, Validating Testing)


Data Explore
Data Testing
Data Transform
Linear Model
SVM Model
Tree Analysis
RandomForest Analysis
Model Evaluation
ROC
Lift Curve
Sensitivity
Error/ Confusion matrices

Part -13: Business Intelligence

Working with Stored Processes


Overview of Stored Processes
Creating and Registering a Stored Process
Manually Creating and Registering a Stored Process
Solutions to Exercises
For Your Information v

Chapter-16

Using Enterprise Guide


Investigating the Features of Enterprise Guide
Exploring Enterprise Guide
Using the Create New Stored Process Wizard
Working with OLAP Cubes in Enterprise Guide

Chapter-17

Using Information Map Studio


Overview of Information Map Studio
Using Information Map Studio
Solutions to Exercises

Chapter-18

Using Web Report Studio


Overview of Web Report Studio
Using Web Report Studio
Solutions to Exercises

Chapter-19

Data Warehousing for Data Modeling


Data Warehousing for Report Building
Stars Schemes for Data Marts
Multi dimensional summarization (OLAP)
Web analytics (Concepts)

Big Data analysis


Part -14 Hadoop

Introduction to big data


Sources of big data
Hadoop distributed file system
Employing Hadoop MapReduce
Statistical Analysis of Big Data

Using the Information Delivery Portal


Overview of the Information Delivery Portal
Using the Information Delivery Portal

Chapter-20

Dashboards
Overview of dashboard
Creating Data model
Creating editing indicators
Creating editing gauges of type of gauges

Вам также может понравиться