Академический Документы
Профессиональный Документы
Культура Документы
Subject CT3
Contents
If you think that any pages are missing from this pack, please contact ActEd’s admin team
by email at ActEd@bpp.com.
Guidance on how and when to use the Mock Exam A Solutions is set out
in the Study Guide for the 2018 exams.
This study material is copyright and is sold for the exclusive use of the purchaser. You
may not hire out, lend, give out, sell, store or transmit electronically or photocopy any
part of it. You must take care of your material to ensure that it is not used or
copied by anybody else. By opening this pack you agree to these conditions.
Markers: This document sets out one approach to solving each of the questions. Please
give credit for other valid approaches.
Exact figures have been used for intermediate calculations. If rounded values are used
then slightly different answers may be obtained.
Solution 1
A Type I error occurs when H 0 is rejected when it is true. The test is of the form
“Reject H 0 if X 25 ”. So we want:
X
Now X ~ N ( , 2 n) and 2 100 . Hence Z ~ N 0,1 [½]
100 n
25 20 n
P X 25 when 20 P Z PZ 2 [½]
100 n
Setting this probability equal to 0.02 and using page 162 of the Tables, we get:
n
2.0537 [½]
2
Solving this we get n 16.87 . So the required minimum sample size is 17. [1]
[Total 3]
Solution 2
(i) Definition
U n1
V n2
where U ~ n21 and V ~ n22 and U and V are independent random variables. [1]
Dividing by the degrees of freedom and then dividing the random variables, we get:
(n1 1) S12
(n1 1)
12 S12 12
~ Fn1 1, n2 1 [1]
(n2 1) S 22 S22 22
(n2 1)
22
We have achieved the desired result since this corresponds to the definition of the F
(n1 1) S12 (n2 1) S 22
distribution in part (i), with U ,V and n1 and n2 replaced by
12 22
n1 1 and n2 1 .
(iii) Probability
We have n1 11 , n2 19 , 12 22 2 , so:
S12
~ F10,18 [1]
S 22
S2
P 12 2.9 P( F10,18 2.9)
S2
which (from page 173 of the Tables) is just less than 2.5% (by interpolation it is
approximately 2.4%). [1]
[Total 2]
Solution 3
We require:
So, by interpolation:
So, by interpolation:
0.95 P( X 0) P( X 1) e 1 1e 1
0.05 P( X 0) P( X 1) P( X 2)
22
e 2 2 e 2 e 2
2
Solution 4
Course reference: Chapter 6 Sections 1.3 and 1.4 and Chapter 7 Section 1.
(i) Independent
ie the joint density function factorises. This is clearly not the case and so U and V are
not independent. [½]
Do be aware that whilst we can use E (UV ) E (U ) E (V ) to show that random variables
are not independent, the converse result E (UV ) E (U ) E (V ) does not necessarily
imply that U and V are independent (it is a necessary not a sufficient condition – they
could be just uncorrelated). [Total 1]
fU ,V (u , v)
fU |V v (u , v) [½]
fV (v)
fV (v) fU ,V (u, v) du
1
1
(3u 5v) du
0
4
1
1 3 2
u 5uv
4 2 0
13 3 5
5v v [1]
4 2 8 4
Note to markers: if this was determined in part (i) full credit should be given.
Therefore:
1 (3u 5v )
fU |V v (u, v) 4
1 3 5v
4 2
3u 5v 6u 10v
3 5v
or 0 u 1 [½]
2
3 10v
[Total 2]
E (U | V v) u fU |V v (u , v) du
1
3u 5v
u 3 5v
du [1]
0 2
1
1
3
5v 0
3u 2 5uv du
2
1 3 5 2 1 1 52 v 2 5v
3 5v
u 2u v 3 [1]
0 5v 3 10v
2 2
[Total 2]
Solution 5
Hence:
480
x 9.6 [½]
50
1 2 1
s2 4,800 50 9.6 49 192 3.9184
[1]
49
s 3.9184 1.979 [½]
[Total 3]
The mean would decrease as we have added lower values to the data set. [½]
The standard deviation would increase, as the data set is now spread out over a wider
range (3 to 12 instead of 8 to 12). [½]
The data set will become more negatively skewed as more of the data values are on the
negative (left) side of the mode (or median or mean). [1]
[Total 2]
For comparison:
Solution 6
X
~ tn 1
S n
X
P 2.093 2.093 0.95
S 20
S S
X 2.093 X 2.093
20 20
s
x 2.093 [1]
n
We have:
52 48.2
x 2.6 and s 2 2.5368 [1]
20 19
(n 1) S 2
2
~ n21
19S 2
P 8.907 2 32.85 0.95 [1]
19S 2 19S 2
2
32.85 8.907
Solution 7
Let X denote the number of boys in a sample of 4 children from the population. Then
X ~ Bin 4,½ and:
P X 2 P X 2 P X 3 P X 4
2 2 3 1 4 0
4 1 1 4 1 1 4 1 1
0.6875 [1]
2 2 2 3 2 2 4 2 2
Now let Y denote the number of samples out of 100 with at least 2 boys. Then
Y ~ Bin 100, 0.6875 and this distribution can be approximated by the normal
distribution with the same mean and variance. We have:
59.5 68.75
P Y 60 P N 0,1 [1]
21.484375
P N 0,1 1.99563
P N 0,1 1.99563
0.977 [1]
[Total 4]
Markers: please deduct a mark if the continuity correction has not been used. Without
the correction, the answer is (1.88776) 0.970 .
Solution 8
P( L T )
We want P( L | T ) . Drawing a probability tree:
P(T )
T
0.6
E
0.7
0.4 NT
T
0.55
0.3
L
0.45
NT
We have:
0.165
P( L | T ) 0.282 [1]
0.585
[Total 3]
The probabilities of getting to London on time for work given the trains are:
Since the tree diagram would be messy, it would be quicker to use Bayes’ Theorem.
We require P (T | M ) so using the formula from page 5 of the Tables, we get:
P(T | M ) P( M )
P(M | T ) [1]
P(T | E ) P( E ) P(T | M ) P( M ) P(T | L) P ( L)
0.57 0.2
[1]
(0.6 0.7) (0.57 0.2) (0.55 0.1)
0.194 [1]
[Total 3]
Solution 9
M S (t ) M N ln M X (t )
Since:
and:
t
M X (t ) 1 (1 2t ) 74 [½]
we have:
M S (t ) M N ln M X (t ) exp 204 (1 2t ) 74 1
[½]
Hence:
CS (t ) ln M S (t ) 204 (1 2t ) 74 1 [½]
[Total 2]
We first need to find the skewness, since, by definition, the coefficient of skewness is:
skew[ S ]
[½ ]
var[S ]1.5
Differentiating the cumulant generating function:
Hence:
skew[ S ] 688,377,600
0.0714 [½ ]
var[S ]1.5
4,528,8001.5
[Total 4]
Solution 10
Course reference: Chapter 6 Section 4.4, Chapter 4 Sections 3.2 and 2.2.
1
t
M X i (t ) 1 [½]
n
Consider the MGF of Xi :
i 1
Since this is the MGF of a gamma (n, ) random variable, by the uniqueness property of
n
MGFs, we can say that X i ~ gamma(n, ) . [½]
i 1
[Total 3]
The waiting time in days to the fourth claim is T1 T2 T3 T4 , where the Ti ’s are
independent and Ti ~ Exp (10) .
We require P ( S 0.5) . Using the fact that if X ~ gamma ( , ) then 2 X ~ 22 : [½]
Alternatively, we could calculate the probability that there are fewer than 4 claims in
half a day. N ~ Poi (5) is the number of claims in half a day. Hence, using the Poisson
tables:
P ( S 0.5) P( N 4) P( N 3) 0.265
Solution 11
Course reference: Chapter 3 Sections 3.1 and 3.2. Chapter 10 Sections 1.1 and 2.1.
The density function is a straight line. The area under the line must be equal to 1. So if
it crosses the vertical axis at (0, h ), we have:
½ h 1
2
which gives h . Since the line goes through the points (0, 2 ) and ( , 0) , the
gradient of the line is:
2 0 2
2 [1]
0
Using the equation y mx c (the equation of a straight line with gradient m and
y -intercept c ), we have:
2 2 2 2 x 2( x)
f ( x) x , 0 x [1]
2 2
2
[Total 2]
We have:
2( x) 1 2 2 x3
E( X ) x dx 2 x [1]
0
2 3 0 3
and:
2( x) 1 2 x3 2 x 4 2
E( X ) x
2 2
dx 2 [1]
0
2 3 4 0 6
2 2 2
var( X ) [1]
6 9 18
[Total 3]
ˆ
3.04 [1]
3
If the actual value of was 9.12, this would give a probability of zero of obtaining a
sample value of 10. [1]
Taking logs:
d 1 10
log L [1]
d xi
d 1 1 1 1 1 10
d log L 12 11.2 10.7 10.6 10.3 2 12 0.0408 [½]
d 1 1 1 1 1 10
d log L 12.5 11.7 11.2 11.1 10.8 2.5 12.5 0.0426 [½]
So there is a value for between 12 and 12.5 for which the derivative of the log of the
likelihood function is zero. [½]
0 0.0408
12 (12.5 12) 12.2 [½]
0.0426 0.0408
d2 1 10
log L 2 [½]
d 2
xi
2
and:
d2 1 1 1 1 1 10
2 log L 2
2
2
2
2
d 12.2 11.4 10.9 10.8 10.5 2.2 12.22
0.173 [½]
Since the first derivative is zero and the second derivative is negative when 12.2 ,
the maximum likelihood estimate of is 12.2. [½]
[Total 6]
Solution 12
Course reference: Chapter 13 Sections 1, 3.2, 3.5, 3.3, 2.2 and 3.6, respectively.
(i) Comment
The scores look like they follow a reasonably straight line, so the model appears
suitable. [1]
2
639
s xx x 2 nx 2 45,507 10 4,674.9 [½]
10
639 622
s xy xy n x y 43, 205 10 3, 459.2 [½]
10 10
s xy 3, 459.2
ˆ 0.740 [1]
s xx 4,674.9
H 0 : 0.5 vs H1 : 0.5
We know that:
ˆ
~ tn 2
ˆ 2 S xx
We need to calculate ˆ 2 :
1
2
s xy 1 3, 459.22
ˆ2
s yy 4,669.6 263.74 [1]
n 2 s xx 8 4,674.9
2
622
where s yy 43,358 10 4,669.6 .
10
0.740 0.5
1.01 [1]
263.74
4,674.9
From page 163 of the Tables, P (t8 1.01) 15% , so we have insufficient evidence to
reject H 0 at the 5% level. So it is reasonable to assume that the gradient parameter is
0.5. [1]
Alternatively, the value of the test statistic is less than the 5% critical value of 1.86, so
we have insufficient evidence to reject H 0 at the 5% level.
[Total 3]
2
2 s xy 3, 459.22
R 0.548 54.8% [1]
s xx s yy 4,674.9 4,669.6
This value is low which indicates that the model is not a good fit to the data. This is due
to the large spread of the scores around the regression line. [1]
[Total 3]
H 0 : 0.9 vs H1 : 0.9
zr z
~ N (0,1)
1 n3
1
where zr tanh 1 r 12 ln 11 rr and z tanh 1 12 ln 1 .
This is greater than the 5% critical value of 1.960 , so we have insufficient evidence to
reject H 0 at the 5% level. Therefore we conclude that 0.9 . [1]
[Total 3]
This means we have a p -value of 2 8.41% 16.82% , which is clearly not significant.
This part of the question uses the “individual response” section of the notes. From the
notes:
yˆ0 y0
~ tn 2 [1]
1 ( x0 x ) 2 2
1 n S ˆ
xx
(Markers, please award ½ for recognising the “individual response” and ½ for the
t distribution.)
1 (50 63.9) 2
51.9 2.306 1 263.74 [1]
10 4, 674.9
Solution 13
Course reference: Chapter 14 Sections 1.2, 1.3, and 2.3. Chapter 11 Section 3.2.
We have:
1 1
ˆ y yij
n i j
ˆ
18
117 6.5 [1]
1 1
ˆi yi y
ni
yij n yij
j i j
So:
27
ˆ1 6.5 0.25 [½]
4
15
ˆ2 6.5 1.5 [½]
3
34
ˆ3 6.5 0.83 [½]
6
41
ˆ4 6.5 1.7 [½]
5
[Total 3]
(ii) ANOVA
We assume the observations are from normal populations with the same variance. [1]
H0 : Each company has the same mean number of claims per month
H1 : There are differences between the mean numbers of claims received per month
by the different companies [½]
We have:
1 2 117 2
SST yij2 y 843 82.5 [1]
n 18
8.54
The variance ratio is F 2.10 . [1]
4.06
Under H 0 , this has an F3,14 distribution. The 5% critical point is 3.344, so we have no
evidence to reject H 0 , and we conclude that there is no difference between the mean
number of claims received by each company. [1]
[Total 7]
SS R 56.88
ˆ 2 4.063 [1]
nk 14
In other words, the estimate of the error variance is the residual mean square. This is
an entry in the ANOVA table, so we could just write it down from there.
Now:
(n k )ˆ 2 SS R
2
~ n2 k or 2
~ n2 k [½]
2
We have a 14 distribution, so:
SS
P 5.629 2R 26.12 0.95 [½]
SS R SS R
2
26.12 5.629
56.88 56.88
, 2.18,10.10 [1]
26.12 5.629
[Total 3]
27
y1 6.75
4
15
y2 5
3
34
y3 5.6
6
41
y4 8.2 [½]
5
So we have:
y2 y3 y1 y4 [½]
Comparing y2 and y3
1 1 1 1
t14, 2.5% ˆ 2 2.145 4.063 3.057
n2 n3 3 6
y2 y3 y1 y4
Comparing y2 and y1
1 1 1 1
t14, 2.5% ˆ 2 2.145 4.063 3.302
n2 n1 3 4
y2 y3 y1 y4
Comparing y2 and y4
1 1 1 1
t14, 2.5% ˆ 2 2.145 4.063 3.157
n2 n4 3 5
Comparing y3 and y4
1 1 1 1
t14, 2.5% ˆ 2 2.145 4.063 2.618
n3 n4 6 5
y2 y3 y1 y4 [1]
___________
[Total 5]
Least significant
Mean Absolute difference Not significant
difference
y2 y3 0.6 3.057
y2 y1 1.75 3.302
y2 y4 3.2 3.157
y3 y1 1.083 2.791
y3 y4 2.53 2.618
y1 y4 1.45 2.900
Award 1 mark for the first two tests and ½ mark for the next two tests.