Академический Документы
Профессиональный Документы
Культура Документы
Data Tools
• Data Analysis
• Statistical Analysis
• Limitations
Data Analysis
• Introduction
• Types of Analytics
o Descriptive Analytics
o Diagnostic Analytics
o Predictive Analytics
o Prescriptive Analytics
• Areas of Analytics
• Customer Analytics
• Risk Analytics:
• Financial Analytics:
• Performance Analytics:
Introduction
• Exploring data
• Machine learning models and predictive stuff
• Getting the Data
• Manipulate & Clean-up to analyze
• Example:
o Multinational Computer Hardware Production organization has a
requirement to analyze production data & identify the defect patterns across
all its manufacturing plants across the globe.
o Using the proper Data Analysis technique and tool
o Defects can be obtained & analysed for questions
• number of defects for each plant / machine / per day etc
o Optimize their Supply-Chain Process and do a predication of Demand-
Supply Chain
Types of Analytics
• Descriptive Analytics
o What has already happened and it is the simplest form
o Data is sliced into smaller chunks
• Eg:
o Banking Client’s data
• Geographical dispersion of Client’s
• What are different Segments of Client’s (HNI etc.,)
• Clients expenditure’s, income patterns etc.,
• Diagnostic Analytics
o Why something’ has happened from the data gathered
o Digs deeper in to the data to identify the real cause
Types of Analytics - Cont..
• Predictive Analytics
o Predications based on the trends identified in the Data
o Uses different Statistical, Data modeling and Data mining
techniques
o Business Analyst and Data scientist can make forecast for the future
time
o Defining the Business Strategies, Objectives and Processes
• Prescriptive Analytics
o Augment the decision making process of organizations
o Identify the best possible solution among the available solutions
o Considers the business limitations enforced
Types of Analytics - Cont..
• Prescriptive Analytics
• Augment the decision making process of organizations
• Identify the best possible solution among the available solutions
• Considers the business limitations enforced
Area of Analytics
• Customer Analytics
o Organizations gains the insight about customer
o Decisions about the customers offers / discounts etc.,.
• Risk Analytics
o Predict the indecisions of the
o forthcoming business atmosphere
o Evaluating the probability of project’s success
• Financial Analytics(Executives)
o Provides various options to address financial related business queries
o Predict and device financial strategies
• Performance Analytics
o Optimize the day to day operations
o Budgeting for short period, making strategies for meeting the SLA (Service
level Agreement)
Data Analysis using Excel
• Perform basic data computations & also data analysis
• Financial modeling and business planning
• Gold standard for nearly all businesses worldwide
• Wide range of functions, visualization, arrays empowers to quickly
generate insights from data
Why Use Excel
• Most powerful spreadsheet package brought by Microsoft
• A spreadsheet is essentially a matrix of rows and columns
• Extensively used for Data Analysis tool, primarily for following reasons:
o Cost
o Ease of Use
o Learning
Features
• Common features are:
o Workbook Management Cursor
o Management Manipulating Data
o Using Formulae and Functions
o Formatting Spreadsheet
o Printing and Layout
o Creating Charts and Graphs
Sort by color
Reverse list
Randomize list
Player Sort by color Team Span Mat Runs
SK Raina CSK GL 2008 2017 150 4206
Note: this date filter and many other date filters depend on today's date
Date Filter - Sample
Player Country Mat Runs Date
11/11/1985
Yusuf Pathan India 138 2788
12/17/1982
SK Raina India 150 4206
11/27/1986
S Dhawan India 117 3202
12/5/1985
V Sehwag India 104 2728
10/20/1978
V Kohli India 140 4172
11/5/1988
Advanced filter
• Display records that meet complex criteria
• Notice the options to copy your filtered data set to another location and
display unique records only (if your data set contains duplicates)
Advanced Filter - SAmple
Last Name Sales Country Quarter
USA Qtr 4
Last Name Sales Country Quarter
Smith $16,753.00 UK Qtr 3
Johnson $14,808.00 USA Qtr 4
Williams $10,644.00 UK Qtr 2
Jones $1,390.00 USA Qtr 3
Brown $4,865.00 USA Qtr 4
Williams $12,438.00 UK Qtr 1
Johnson $9,339.00 UK Qtr 2
Smith $18,919.00 USA Qtr 3
Jones $9,213.00 USA Qtr 4
Jones $7,433.00 UK Qtr 1
Brown $3,255.00 USA Qtr 2
Williams $14,867.00 USA Qtr 3
Williams $19,302.00 UK Qtr 4
Smith $9,698.00 USA Qtr 1
Data form
• Allows to add, edit and delete records (rows) and display only those records
that meet certain criteria
• Especially in wide rows and want to avoid repeated scrolling to the right
and left
Quick Access Toolbar
• By default, the Quick Access Toolbar, located above the ribbon, contains the
Save, Undo and Redo button
• If you use in Excel command frequently, you can add it to the Quick Access
Toolbar
• You can even add commands to the Quick Access Toolbar that are not in the
ribbon
• Right click anywhere on the ribbon, and then click Customize Quick Access
Toolbar
Quick Access toolbar
Remove duplicate
• Click any single cell inside the data set.
• On the Data tab, in the Data Tools group, click Remove Duplicates
• Excel removes all rows with the same fields (blue) except for the first instances
found (yellow)
Outlining data
• Makes your data easier to view
• Total rows of related data and collapse a group of columns
• On the Data tab, in the Outline group, click Subtotal
• Select the Company column, the column we use to outline our worksheet
• Use the Count function
• Check the Company check box
• Click OK
• To collapse a group of cells, click a minus sign. You can use the numbers
to collapse or expand groups by level. For example, click the 2 to only
show the subtotals
Note: click the 1 to only show the Grand Count, click the 3 to show
everything
Outlining data - sample
Conditional Formatting
Highlight cells with a certain color, depending on the
cell's value
Manage Rules
Data Bars
Color Scales
Icon Sets
New Rule
Find Duplicates
Shade Alternate Rows
Compare Two Lists
Conflicting Rules
Checklist
Manage Rules
• On the Home tab, in the Styles group, click Conditional Formatting
• From the drop-down list, change Current Selection to This Worksheet, to view all
conditional formatting rules in this worksheet
Note: click New Rule, Edit Rule and Delete Rule to create, edit and delete rules
Data Bars
• Visualize values in a range of cells
• Longer bar represents a higher value
• Select a range
• On the Home tab, in the Styles group, click Conditional Formatting
• Click Data Bars and click a subtype
• Change the values
Result: Excel updates the data bars automatically. Read on to further customize these
data bars
• Select the range
• On the Home tab, in the Styles group, click Conditional Formatting, Manage Rules
• Click Edit rule
• Excel launches the Edit Formatting Rule dialog box. Here you can further customize
your data bars (Show Bar Only, Minimum and Maximum, Bar Appearance, Negative
Value and Axis, Bar Direction, etc)
Color Scales
• Shade of the color represents the value in the cell
o Customize your color scale (Format Style, Minimum, Midpoint and Maximum, Color, etc)
Icon Sets
• Each icon represents a range of values
• Customize your icon set (Icon Style, Reverse Icon Order, Show Icon Only, Icon,
Value, Type, etc)
New Rule
• Highlight Cells Rules, Top/Bottom Rules, Data Bars, Color Scales and Icon
Sets are shortcuts. They can also be found under New Rule
• Select 'Use a formula to determine which cells to format'
• Enter the formula
• Select a formatting style and click OK
Find Duplicates
• Click Highlight Cells Rules, Duplicate Values
Note: only use the Stop If True check boxes for backwards compatibility with earlier
versions of Microsoft Excel
Checklist
• Create a checklist in Excel
• First, turn on the Developer tab
• On the Developer tab, in the Controls group, click Insert
Charts
• Say more than a sheet full of umber
o Column Chart o Trendline
o Line Chart o Error Bars
o Pie Chart o Sparklines
o Bar Chart o Combination Chart
o Area Chart o Gauge Chart
o Scatter Chart o Thermometer Chart
o Data Series o Gantt Chart
o Axes o Pareto Chart
o Chart Sheet
Column Chart
• Column charts are used to compare values across categories by using vertical
bars
• On the Insert tab, in the Charts group, click the Column symbol
• Click Clustered Column
Line Chart
• Display trends over time
• Use a line chart if you have text labels, dates or a few numeric labels on the
horizontal axis
• On the Insert tab, in the Charts group, click the Line symbol
• Click Line with Markers
Pie Chart
• Display the contribution of each value (slice) to a total (pie)
• Pie charts always use one data series
• On the Insert tab, in the Charts group, click the Pie symbol
Bar Chart
• Horizontal version of a column chart
• Use a bar chart if you have large text labels
• Click Clustered Bar
Area Chart
• Line chart with the areas below the lines filled with colors
• Use a stacked area chart to display the contribution of each value to a total over
time
• On the Insert tab, in the Charts group, click the Line symbol
• Click Area
• Change the chart's subtype to Stacked Area
(the one next to Area)
Scatter Chart
• Use a scatter chart (XY chart) to show scientific XY data
• Scatter charts are often used to find out if there's a relationship between
variable X and Y
• On the Insert tab, in the Charts group, click the Scatter symbol
• Click Scatter with Straight Lines
Data Series
• A row or column of numbers that are plotted in a chart
• Plot one or more data series in a chart
• On the Insert tab, in the Charts group, click the Column symbol
• Click Clustered Column
Select Data Source
• Launch the Select Data Source dialog box
• Select the chart
• Right click, and then click Select Data
Axes
• Axis Type | Axis Titles | Axis Scale
• Most chart types have two axes
o Horizontal axis (or x-axis)
o Vertical axis (or y-axis)
• Axis Type:
• Shows the dates between 8/24/2018 and 9/1/2018. To remove
these dates, change the axis type from Date axis to Text axis
Chart Sheet
• Move a chart to a separate sheet that only contains a chart (chart sheet)
• On the Design tab, in the Location group, click Move Chart
• Click New sheet and enter a name
Trendline
• Click the + button on the right side of the chart, click the arrow next to Trendline and then click
More Options
• The Format Trendline pane appears.
• Choose a Trend/Regression type. Click Linear
• Specify the number of periods to include in the forecast. Type 3 in the Forward box
• Check "Display Equation on chart" and "Display R-squared value on chart"
Error Bars
• Click the + button on the right side of the chart, click the arrow next to
Error Bars and then click More Options
• The Format Error Bars pane appears
• Choose a Direction. Click Both
• Choose an End Style. Click Cap
• Click Fixed value
Sparklines
• Graphs that fit in one cell and give you information about the data
• On the Insert tab, in the Sparklines group, click Line
• Click in the Data Range box and select the range
Combination Chart
• Combines two or more chart types in a single chart
• On the Insert tab, in the Charts group, click the Combo symbol
• Click Create Custom Combo Chart
Gauge Chart
• Combines a Doughnut chart and a Pie chart in a single chart
Thermometer Chart
• Shows you how much of a goal has been achieved
Gantt Chart
• Customizing the stacked bar chart type
Pareto Chart
• Principle states that, for many events, roughly 80% of the effects come from 20% of
the causes
• On the Insert tab, in the Charts group, click the Histogram symbol
• Click Pareto
Pivot Tables
• One of Excel's most powerful features. A pivot table allows you to
extract the significance from a large, detailed data set Extract the
significance from a large, detailed data set
• Calculate margins and other common ratios using calculation
• Filter data using slicers in multiple pivot tables
• Create aggregate reports using formula based techniques
• Create flexible data aggregations & Represent data visually
Group Pivot Table Items
Multi-level Pivot Table
Frequency Distribution
Pivot Chart
Slicers
Update Pivot Table
Calculated Field/Item
Group Pivot Table Items
Multi-level Pivot Table
• Multiple Row Fields
• Multiple Value Fields
• Multiple Report Filter Fields
Frequency Distribution
• Use pivot tables to easily create a frequency distribution in Excel
• Can also use the Analysis Toolpak to create a histogram
Pivot Chart
• Visual representation of a pivot table in Excel
• Pivot charts and pivot tables are connected with each other
• Click any cell inside the pivot table
• On the Analyze tab, in the Tools group, click PivotChart
Slicers
• Can insert slicers in Excel to quickly and easily filter pivot tables
• Click any cell inside the pivot table
• On the Analyze tab, in the Filter group, click Insert Slicer
Update Pivot Table
• Any changes you make to the data set are not automatically picked up by the pivot table
• Refresh the pivot table or change the data source to update the pivot table with the applied changes
Refresh
• Click any cell inside the pivot table
• Right click and click on Refresh
Change Data Source
• Click any cell inside the pivot table.
• On the Analyze tab, in the Data group, click Change Data Source
Calculated Field/Item
• Insert a calculated field or calculated item in a pivot table
Calculated Field
• A calculated field uses the values from another field. To insert a calculated field, execute the following
steps.
• Click any cell inside the pivot table.
• On the Analyze tab, in the Calculations group, click Fields, Items & Sets
Calculated Item
• A calculated item uses the values from other items. To insert a calculated item, execute the following
steps.
• Click any Country in the pivot table.
• On the Analyze tab, in the Calculations group, click Fields, Items & Sets
Tables
Analyze your data in Excel quickly and easily
• Structured References
o Make your formulas easier to understand
• Table Styles
o Create your own table style
o Quickly format a cell by choosing a cell style
What-if Analysis
Allows you to try out different values (scenarios) for
formulas
• Data Tables
• Goal Seek
• Quadratic Equation
• Scenario Manager
Data Tables
• One Variable Data Table
• Two Variable Data Table
Instead of creating different scenarios, create a data table to quickly try out different values
for formulas
On the Data tab, in the Forecast group, click What-If Analysis
Quadratic Equation
• Form ax2 + bx + c = 0 where a ≠ 0
• A quadratic equation can be solved by using the quadratic formula
• Can also use Excel's Goal Seek feature to solve a quadratic equation
Solver
Uses techniques from the operations research to find optimal
o Descriptive Statistics
o Anova
o F-Test
o t-Test
o Moving Average
o Exponential Smoothing
o Correlation
o Regression
Histogram
• How to create a histogram in Excel
• First, enter the bin numbers (upper levels) in the range
• On the Data tab, in the Analysis group, click Data Analysis
• Select Histogram and click OK
Descriptive Statistics
• On the Data tab, in the Analysis group, click Data Analysis
• Select Descriptive Statistics and click OK
• Select the range A2:A15 as the Input Range
• Select cell C1 as the Output Range
• Make sure Summary statistics is checked
Anova
• Perform a single factor ANOVA (analysis of variance) in Excel
• A single factor or one-way ANOVA is used to test the null hypothesis that the
means of several populations are all equal
F - Test
• Used to test the null hypothesis that the variances of two populations are
equal
T - Test
• Used to test the null hypothesis that the means of two populations are equal
Moving Average
• Used to smooth out irregularities (peaks and valleys) to easily recognize
trends
Exponential smoothing
• Used to smooth out irregularities (peaks and valleys) to easily recognize
trends
Correlation
• Corelation coefficient (a value between -1 and +1) tells you how strongly two
variables are related to each other
• We can use the CORREL function or the Analysis Toolpak add-in in Excel to find the
correlation coefficient between two variables
• A correlation coefficient of +1 indicates a perfect positive correlation
• As variable X increases, variable Y increases. As variable X decreases, variable Y
decreases
Regression
• How to interpret the Summary Output & Significance F and P-values
Formatting
o This function looks for specified value in an array of cells and then returns the
relative location of that element.
Essential keyboard shortcuts
• Ctrl +[Down|Up Arrow]: Moves to the top or bottom cell of the current column
and combination of Ctrl with Left|Right Arrow key, moves to the cell furthest left
or right in the current row
• Ctrl + Shift + Down/Up Arrow: Selects all the cells above or below the current
cell
• Ctrl+ Home: Navigates to cell A1
• Ctrl+End: Navigates to the last cell that contains data
• Alt+F1: Creates a chart based on selected data set.
• Ctrl+Shift+L: Activate auto filter to data table
• Alt+Down Arrow: To open the drop down menu of autofilter. To use this shortcut:
• Alt+D+S: To sort the data set
• Ctrl+O: Open a new workbook
• Ctrl+N: Create a new workbook
• F4: Select the range and press F4 key, it will change the reference to absolute, mixed
and relative
Limitations