Академический Документы
Профессиональный Документы
Культура Документы
than it is now?
Symbols
process data
Inferential statistics
Using data to make predictions, forecasts, and estimates to assist
decision making
Descriptive Statistics
Collect data
e.g., Survey
Present data
X
n
Inferential Statistics
Estimation
e.g., Estimate the
population mean weight
using the sample mean
weight
Hypothesis testing
e.g., Test the claim that
the population mean
weight is 140 pounds
Inference is the process of drawing conclusions or making decisions
about a population based on sample results
Types of Data
Examples:
Marital Status
Are you registered to
vote?
Eye Color
(Defined categories or
groups)
Examples:
Number of Children
Defects per hour
(Counted items)
Examples:
Weight
Voltage
(Measured characteristics)
Measurement Levels
Differences between
measurements, true
zero exists
Ratio Data
Quantitative Data
Differences between
measurements but no
true zero
Interval Data
Ordered Categories
(rankings, order, or
scaling)
Ordinal Data
Categories (no
ordering or direction)
Nominal Data
Qualitative Data
Measurement Levels
Variables can be split into categorical and continuous, and within these types there
Categorical
Nominal
The lowest scale
Numbers assigned to identify attributes
No order/ sequence
Ordinal scale
The same as a nominal variable but the
categories have a logical order
Arrange from lowest to highest or vice versa
Continuous variable
Interval Scale
Equal intervals on the variable represent equal
differences in the property being measured
Arbitrary zero
Ratio Scale
Same as an interval variable, equal intervals on the
variable represent equal differences in the property being
measured
True zero
received.
Nationality
Perception scores
Quality of work life
scores.
Income categories
Musical ability
9. Favorite
Food
10. Speaking Ability
9.
making
Some type of organization is needed
Table
Graph
The type of graph to use depends on the variable being
summarized
Numerical
Variables
Frequency distribution
Histogram and ogive
Stem-and-leaf display
Scatter plot
Categorical
Data
Tabulating Data
Frequency
Distribution
Table
Graphing Data
Bar Chart
Pie Chart
Hospital Unit
Cardiac Care
Emergency
Intensive Care
Maternity
Surgery
(Variables are
categorical)
Number of Patients
1,052
2,245
340
552
4,630
(category) data
Height of bar or size of pie slice shows the frequency or
Number
of Patients
Cardiac Care
1,052
Emergency 2,245
Intensive Care
340
Maternity
552
Surgery
4,630
Number
of Patients
Cardiac Care
1,052
Emergency 2,245
Intensive Care
340
Maternity
552
Surgery
4,630
% of
Total
11.93
25.46
3.86
6.26
52.50
(Percentages
are rounded to
the nearest
percent)
Histogram
Ogive
Stem-and-Leaf
Display
Frequency Distributions
What is a Frequency Distribution?
A frequency distribution is a list or a table
useful form...
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Interval
Frequency
Percentage
Relative
Frequency
0.15
0.30
0.30
0.25
0.25
0.20
0.20
0.10
0.10
20
100
TOTAL
0.15
Histogram
A graph of the data in a frequency distribution is called a
histogram
The interval endpoints are shown on the horizontal axis
The vertical axis is either frequency, relative frequency,
or percentage
Bars of the appropriate heights are used to represent the
Histogram Example
Interval
10 but less than 20
20 but less than 30
30 but less than 40
40 but less than 50
50 but less than 60
Frequency
3
6
5
4
2
(No gaps
between
bars)
Temperature in Degrees
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58
Class
Frequency Percentage
Cumulative Cumulative
Frequency Percentage
15
30
25
3
9
14
20
10
45
70
18
20
100
15
90
100
Upper
interval Cumulative
endpoint Percentage
10
20
30
40
50
60
0
15
45
70
90
100
Interval endpoints
Stem-and-Leaf Diagram
A simple way to see distribution details in a data set
Example
Data in ordered array:
21, 24, 24, 26, 27, 27, 30, 32, 38, 41
21 is shown as
38 is shown as
Example
(continued)
Leaves
1 4 4 6 7 7
0 2 8
Stem
Leaf
12
...
1224 becomes
Data:
Stem
Leaves
6
7
8
9
10
136
2258
346699
13368
356
11
12
47
2
variable
When two variables exist other techniques are used:
Categorical
(Qualitative)
Variables
Numerical
(Quantitative)
Variables
Cross tables
Scatter plots
Scatter Diagrams
Scatter Diagrams are used for paired
Cost per
day
23
125
26
140
29
146
33
160
38
167
42
170
50
188
55
195
60
200