Академический Документы
Профессиональный Документы
Культура Документы
1 / 48
Overview
2 / 48
Mean
is the sum of all
The (arithmetic) sample mean X
observations divided by the number of observations:
=
X
Pn
i=1 Xi
3 / 48
Median
Another measure of the average value is the sample median
=
X
Y(n+1)/2
if n is odd
,
1
(Y
+
Y
)
if n is even
n/2
n/2+1
2
Mode
The mode is the most frequently occuring value in the sample
4 / 48
1
2.75
2
2.86
3
3.37
4
2.76
5
2.62
6
3.49
7
3.05
8
3.12
5 / 48
Choice of measure
The choice of measure for the average value depends on the
type of distribution
Average value
arithmetic mean
median
mode
Distribution
usually the preferred measure
one or two extremely high or low values
seldom used
The mean, median and mode are equal when the distribution
6 / 48
Variation
Measures of variation are used to indicate the spread of the values
in a distribution
7 / 48
two quartiles:
IQR = Q3 Q1 ,
where Q1 and Q3 are the lower and upper quartiles
respectively. It indicates the spread of the middle 50% of the
distribution
8 / 48
Variance
The population variance 2 may be estimated by the
s =
Pn
)2
X
,
n1
i=1 (Xi
9 / 48
Standard deviation
The population standard deviation is found as the square
s=
)2
X
=
n1
n
i=1 (Xi
sP
n
2
i=1 Xi
( ni=1 Xi )2 /n
n1
P
10 / 48
Totals
Plasma volume
X
2.75
2.86
3.37
2.76
2.62
3.49
3.05
3.12
Deviation
from the mean
X X
-0.25
-0.14
0.37
-0.24
-0.38
0.49
0.05
0.12
Squared deviation
from the mean
)2
(X X
0.0625
0.0196
0.1369
0.0576
0.1444
0.2401
0.0025
0.0144
Squared
observation
X2
7.5625
8.1796
11.3569
7.6176
6.8644
12.1801
9.3025
9.7344
24.02
0.00
0.6780
72.7980
0.6780
7
= 0.31
11 / 48
Standard error
also has a distribution!
X
I mean equal to the population mean
by
s
sd
.e. = sX = ,
n
12 / 48
13 / 48
the population
the standard error of the sample mean measures the amount
14 / 48
15 / 48
Pn
i=1
Xi
) =
Expectation of the mean: E (X
2
) =
Variance of the mean: Var (X
n
Standard deviation of the mean = standard error:
) =
SD(X
N(, )
The distribution of the mean: X
n
(the central limit theorem)
16 / 48
/ n
N(0, 1)
X
t(n 1),
s/ n
17 / 48
19 / 48
Population
Sample
6
20 / 48
21 / 48
then given by
+ 1.96 s ,
1.96 s , X
95% CI = X
n
n
22 / 48
23 / 48
24 / 48
Small-sample case
In a small-sample case when the sample size, n, is not large,
we have that:
I
when the sample size is very small, say n < 15, and when the
distribution in the population is extremely non-normal. The
former effect invalidates the use of the normal distribution,
and instead we use the t distribution in calculating the
confidence intervals
25 / 48
26 / 48
1
2.2
2
2.4
3
4.9
4
2.5
5
3.7
6
4.3
27 / 48
The 95% confidence interval (in hours) for the average number
of hours of relief for arthritic patients in general is
(3.3 2.57 0.46, 3.3 + 2.57 0.46) = (2.1, 4.5),
where 2.57 is the two-sided 5% point of the t distribution with 5
degrees of freedom
28 / 48
Severe non-normality
When the distribution in the population is markedly non-normal, it
may be desirable to
use a transformation on the scale on which the variable X is
measured, or
calculate a non-parametric confidence interval, or
use bootstrap methods
29 / 48
Since the sample size n > 1, the confidence interval will always be
narrower than the reference range.
30 / 48
31 / 48
Background
32 / 48
T0 =
0
X
t(n 1),
s/ n
33 / 48
automatically
Look up in a table...
34 / 48
35 / 48
newborns:
36 / 48
37 / 48
Paired data
Paired measurements
In medical settings we often deal with paired measurements,
38 / 48
Hours of sleep
Drug
Placebo
6.1
5.2
6.0
7.9
8.2
3.9
7.6
4.7
6.5
5.3
5.4
7.4
6.9
4.2
6.7
6.1
7.4
3.8
5.8
7.3
1 = 6.66
X
0 = 5.58
X
Difference
0.9
-1.9
4.3
2.9
1.2
-2.0
2.7
0.6
3.6
-1.5
= 1.08
X
39 / 48
= 1.08 hours,
The observed mean difference in sleep time was X
and the empirical standard deviation of the differences was
s = 2.31. The
estimated standard error of the differences is
40 / 48
41 / 48
SPSS
1
42 / 48
43 / 48
Click on OK
44 / 48
45 / 48
46 / 48
5433,2kJ
Can we conclude that there is a difference between pre and
post?
47 / 48
Summary
Key words
Sample mean
Sample standard deviation
Standard error vs. standard deviation
The central limit theorem
The student t-distribution
Confidence intervals for the mean
One sample t-test (also for paired data)
Notation
and
X
s and
48 / 48