Академический Документы
Профессиональный Документы
Культура Документы
Chapter 2
Outline
2-1 Review and Preview
2-2 Frequency Distributions
2-3 Histograms
2-4 Graphs That Enlighten and Graphs
That Deceive
Center
A representative value that indicates where the
middle of the data set is located
Variation
A measure of the amount that the data values
vary
Spring 2017 Math 115 / Statistics Chapter 2 / Page 4
Review and Preview
Characteristics of Data
Distribution
The nature or shape of the spread of data over the
range of values
Bell-shaped, uniform, or skewed
Outliers
Sample values that lie very far away from the vast
majority of other sample values
Time
Changing characteristics of the data over time
Spring 2017 Math 115 / Statistics Chapter 2 / Page 5
Review and Preview
Time
In the US 100 years ago
8% of homes had a telephone
14% of homes had a bathtub
The mean life expectancy was 47 years
The mean hourly wage was 22 cents
There were approximately 230 murders in the entire
US
Statistical analysis should always consider
changing population characteristics
Spring 2017 Math 115 / Statistics Chapter 2 / Page 6
Review and Preview
Be careful of what you believe
Why do you believe what you believe?
Sources?
Legitimate?
Questionable?
Agenda?
https://www.ted.com/talks/hans_and_ola_rosli
ng_how_not_to_be_ignorant_about_the_worl
d
Spring 2017 Math 115 / Statistics Chapter 2 / Page 7
Frequency Distributions
When working with large data sets
It is often helpful to organize and summarize
data by constructing a table called a
frequency distribution
Because computer software and calculators
can generate frequency distributions
The details of constructing them are not as
important as what they tell us about data sets
Still need to know how to derive them!!
Spring 2017 Math 115 / Statistics Chapter 2 / Page 8
Frequency Distributions
Frequency Distribution (or Frequency
Table)
Shows how a data set is partitioned among
all of several categories (or classes) by
listing all of the categories along with the
number (frequency) of data values in each
of them
50-69 2
70-89 33
Lower Class Limits 90-109 35
110-129 7
130-149 1
Spring 2017 Math 115 / Statistics Chapter 2 / Page 12
Frequency Distributions
Upper Class Limits
The largest numbers that can belong to a
class
IQ Score Frequency
50-69 2
70-89 33
Upper Class Limits 90-109 35
110-129 7
130-149 1
Spring 2017 Math 115 / Statistics Chapter 2 / Page 13
Frequency Distributions
Class Boundaries
The numbers used to separate classes, but
without the gaps created by class limits
IQ Score Frequency
49.5
50-69 2
69.5
70-89 33
Class Boundaries 89.5 90-109 35
109.5
110-129 7
129.5
130-149 1
149.5
Spring 2017 Math 115 / Statistics Chapter 2 / Page 14
Frequency Distributions
Class Midpoints
The values in the middle of the classes and can be
found by averaging the lower and upper limits
Adding the class upper limit to the class lower limit and
dividing the sum by 2 IQ Score Frequency
(50 + 69) / 2 = 119 / 2 = 59.5 50-69 2
(70 + 89) / 2 = 159 / 2 = 79.5 70-89 33
(90 + 109) / 2 = 199 / 2 = 99.5 90-109 35
(110 + 129) / 2 = 239 / 2 = 119.5 110-129 7
(130 + 149) / 2 = 279 / 2 = 139.5 130-149 1
Spring 2017 Math 115 / Statistics Chapter 2 / Page 15
Frequency Distributions
Class Width
The difference between two consecutive
lower class limits or two consecutive lower
class boundaries IQ Score Frequency
70 50 = 20 50-69 2
90 70 = 20 70-89 33
110 90 = 20 90-109 35
130 110 = 20 110-129 7
150 130 = 20 130-149 1
Spring 2017 Math 115 / Statistics Chapter 2 / Page 16
IQ Score Frequency
50-69 2
70-89 33
90-109 35
130-149
7
50-69 2
70-89 33
90-109 35
130-149
7
50-69 2
70-89 33
90-109 35
130-149
7
50-69 2
70-89 33
90-109 35
130-149
7
50-69 2 2.6%
70-89 33 42.3%
90-109 35 44.9%
110-129 7 9.0%
Frequency Distributions
130-149 1 1.3%
Frequency Distributions
Days to Maturity for Short-Term Investments
Frequency Distribution
Columns 1 and 3
Relative-frequency Distribution
Columns 1 and 4
Frequency Distributions
Days to Tally Number of Frequency Relative Relative Relative
maturity investments frequency frequency frequency
(fraction) (decimal) (percentage)
30 D 39 III 3 3 3 / 40 0.0750 7.50%
40 D 49 I 1 1 1 / 40 0.0250 2.50%
50 D 59 IIIIIIII 8 8 8 / 40 0.2000 20.00%
60 D 69 IIIIIIIIII 10 10 10 / 40 0.2500 25.00%
70 D 79 IIIIIII 7 7 7 / 40 0.1750 17.50%
80 D 89 IIIIIII 7 7 7 / 41 0.1750 17.50%
90 D 99 IIII 4 4 4 / 40 0.1000 10.00%
40 40 40 / 40 = 1 1.0000 100.00%
Frequency Distribution
Columns 1 and 2
Relative-frequency Distribution
Columns 1 and 3
Cumulative Frequencies
50-69 2 2
70-89 33 35
90-109 35 70
110-129 7 77
130-149 1 78
50-69 2
70-89 33
90-109 35
110-129 7
130-149 1
18 16
16 14
14 12
12
Frequency
10
8
6
3
4 2 2
1
2
0
0 1 2 3 4 5 6
Number of TVs
0.35 0.320
0.30 0.280
Relative Frequency
0.240
0.25
0.20
0.15
0.10 0.060
0.020 0.040 0.040
0.05
0.00
0 1 2 3 4 5 6
Number of TVs
6
5 4
4 3
3
2 1
1
0
35 45 55 65 75 85 95
Days to maturity
Short-Term Investments
0.250
0.25
0.200
Relative Frequency
0.15
0.100
0.10 0.075
0.05 0.025
0.00
35 45 55 65 75 85 95
Days to maturity
Examples of skewness
Example
Heights of 3,264 female students
With bell shaped curve
Spring 2017 Math 115 / Statistics Chapter 2 / Page 66
Histograms
Cumulative
Cumulative Relative
IQ Score Frequency Relative
Frequency Frequency
Frequency
50 - 69 2 2 0.0256 0.0256
70 - 89 33 35 0.4231 0.4487
90 - 109 35 70 0.4487 0.8974
110 - 129 7 77 0.0897 0.9872
130 - 149 1 78 0.0128 1.0000
Total 78 1
Histograms
Low Lead Level - Sorted
50 56 70 72 73 74 75 76 76 76 76 76 77 77 78 80
80 80 84 85 85 85 85 86 86 86 86 87 87 88 88 88
89 89 89 91 92 93 94 94 94 95 96 96 96 96 96 96
96 97 97 98 99 99 99 99 100 101 101 102 104 104 105 105
106 107 107 107 107 108 111 115 115 118 120 125 128 141
Histograms
110 - 129 7 0.0897 8.97%
130 - 149 1 0.0128 1.28%
Total 78 1 100.00%
High Lead Relative
IQ Score Percent
Low Lead Level - Sorted Frequency Frequency
50 56 70 72 73 74 75 76 76 76 76 76 77 77 78 80 50 - 69 0.0000 0.00%
80 80 84 85 85 85 85 86 86 86 86 87 87 88 88 88 70 - 89 14 0.7000 70.00%
89 89 89 91 92 93 94 94 94 95 96 96 96 96 96 96 90 - 109 6 0.3000 30.00%
96 97 97 98 99 99 99 99 100 101 101 102 104 104 105 105 110 - 129 0.0000 0.00%
106 107 107 107 107 108 111 115 115 118 120 125 128 141 130 - 149 0.0000 0.00%
Total 20 1 100.00%
Histograms
110 - 129 7 0.0897 8.97%
130 - 149 1 0.0128 1.28%
Total 78 1 100.00%
High Lead Relative
IQ Score Percent
Low Lead Level - Sorted Frequency Frequency
50 56 70 72 73 74 75 76 76 76 76 76 77 77 78 80 50 - 69 0.0000 0.00%
80 80 84 85 85 85 85 86 86 86 86 87 87 88 88 88 70 - 89 14 0.7000 70.00%
89 89 89 91 92 93 94 94 94 95 96 96 96 96 96 96 90 - 109 6 0.3000 30.00%
96 97 97 98 99 99 99 99 100 101 101 102 104 104 105 105 110 - 129 0.0000 0.00%
106 107 107 107 107 108 111 115 115 118 120 125 128 141 130 - 149 0.0000 0.00%
Total 20 1 100.00%
Disadvantages
Not useful for a large data set
Awkward with data containing many digits
32.5%
22.5%