Академический Документы
Профессиональный Документы
Культура Документы
Resources on Blackboard
• Excel tutorials
• StatTools tutorials
Describing Data
• What kind of Data do you encounter in your daily business?
2-2 Basic Concepts
Types of data
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
2-2a Populations and Samples
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
2-2b Data Sets, Variables, and Observations
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.1: Data from an Environmental Survey
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
2-2c Types of Data
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Types of Data
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Types of Data
Categorical _ arithmetic operations do NOT make sense.
• Nominal _ Measurement are categories & numerical values have no
mathematical significance. Example: color.
• Ordinal _ Rank order. Example: horse racing
Numerical _ arithmetic operations make sense.
• Discrete_ Measurement are from counts. Example: size of household.
• Continuous _ continuous measures. Example: bank saving, volume
Test Yourself
Determine Types of Data.
Challenge Level 1:
• political party affiliation
• grades (A, A-, B+… F)
• test scores (SAT, ACT)
• size of household
Test Yourself
Determine Types of Data.
Challenge Level 2:
• Are you a smartphone owner? (1 = Y; 0 = N)
• President Trump
• Q1: Dependable _ _ _ _ _ _ _ _ _ Undependable
• Q2: Trustworthy _ _ _ _ _ _ _ _ _ Untrustworthy
• BZAN6310 is known to be extremely difficult.
1=Strongly Disagree 2=Disagree 3=Neutral 4=Agree 5=Strongly Agree
Types of Data
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Test Yourself
Determine Types of Data.
Challenge Level 3:
• Age
• What is your age? ___________
• What is your age? ___________
• 1 = under 18
• 2 = 18 – 25
• 3 = 26 – 55
• 4 = above 55
• What is your age? ___________
• Young
• Old
Example 2.1: Data from an Environmental Survey
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Types of Data
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Types of Data
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Types of Data
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
2-3 Descriptive Measures for
Categorical Variables
Count the number of observations in each category. (The resulting counts can
be reported as “raw counts” or as percentages of totals.)
Once you have the counts, you can display them graphically, usually in a column
chart or a pie chart.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.2: Supermarket Sales
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.2: Supermarket Sales
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.2: Supermarket Sales
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
2-4a Numerical Summary Measures
Measures of Variability
Measures of Shape
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Measures of Central Tendency
(slide 1 of 3)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Measures of Central Tendency
(slide 2 of 3)
The median is the middle observation when the data are sorted from
smallest to largest.
If the number of observations is odd, the median is literally the middle
observation.
If the number of observations is even, the median is usually defined as the
average of the two middle observations.
In Excel®, the median can be calculated with the MEDIAN function.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Measures of Central Tendency
(slide 3 of 3)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.3: Baseball Salaries
(slide 2 of 10)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Minimum, Maximum, Percentiles, and Quartiles
For any percentage p, the pth percentile is the value such that a
percentage p of all values are less than it.
The quartiles divide the data into four groups, each with
(approximately) a quarter of all observations.
The first, second, and third quartiles are the percentiles corresponding to p
= 25%, p = 50%, and p = 75%.
By definition, the second quartile (p = 50%) is equal to the median.
The minimum and maximum values can be calculated with MIN and
MAX functions, and the percentiles and quartiles with PERCENTILE and
QUARTILE functions in Excel®.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Measures of Variability
(slide 1 of 4)
If all observations are close to the mean, their squared deviations from the
mean—and the variance—will be relatively small.
If at least a few of the observations are far from the mean, their squared
deviations from the mean—and the variance—will be large.
In Excel®, use the VAR.S function to obtain the sample variance and the
VAR.P function to obtain the population variance.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Measures of Variability
(slide 3 of 4)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Empirical Rules for Interpreting Standard Deviation
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Empirical Rules for Interpreting Standard Deviation
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Empirical Rules for Interpreting Standard Deviation
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Measures of Shape
(slide 1 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.3: Baseball Salaries
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Histogram Example: Late or Lost Baggage
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
2-4b Numerical Summary Measures with StatTools
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Basic StatTools Features
(slide 3 of 5)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Histograms
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.3: Baseball Salaries
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.3: Baseball Salaries
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.3: Baseball Salaries
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.3: Baseball Salaries
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Histogram Example: Late or Lost Baggage
This example file contains the count of bags late or lost for 456 flights.
The “natural” bins are the integer values 0 to 8, which matches to
values for the counts.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Histogram Example: Late or Lost Baggage
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Box Plots
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.3: Baseball Salaries
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.3: Baseball Salaries
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.3: Baseball Salaries
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.3: Baseball Salaries
Our main interest in time series variables is how they change over time,
and this information is lost in traditional summary measures and in
histograms or box plots.
For time series data, a time series graph is used. This is a graph of
the values of one or more time series, using time on the horizontal axis.
This is always the place to start a time series analysis.
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.4: Crime in United States
(slide 1 of 6)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.4: Crime in United States
(slide 2 of 6)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.4: Crime in United States
(slide 3 of 6)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.4: Crime in United States
(slide 4 of 6)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.4: Crime in United States
(slide 5 of 6)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.5: The DJIA Index
(slide 1 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Example 2.5: The DJIA Index
(slide 2 of 2)
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
In-class Lab 3 & 4
Lab 3
• Charts for Numerical Variables exercise
• Excel: StatTools (also see StatTools Tutorial)
• Histogram & box plots
• Data set: Baseball Salaries.xlsx
Lab 4
• Time series data exercise
• Excel: StatTools
• Time series graph
• Data set: Crime in US.xlsx
Class Summary
To do
• Quiz 0
• Reading Assignments (on Syllabus)
• Homework (#38, 41, 48; pp. 73-76)
• Say hello to Jove (optional)
2-7 Excel® Tables for Filtering, Sorting, and Summarizing
Sorting
Summarizing
© 2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.