Академический Документы
Профессиональный Документы
Культура Документы
C
I
S
PHY @ HSC
Stephen Bosi
John OByrne
Peter Fletcher
Joe Khachan
Jeff Stanger
Sydney, Melbourne, Brisbane, Perth, Adelaide
and associated companies around the world
Sandra Woodward
Contents
Series features
How to use this book
Stage 6 Physics syllabus grid
vi
viii
x
Module 1 Space
Module 1 Introduction
Chapter 1 Cannonballs, apples, planets and gravity
1.1 Projectile motion
1.2 Gravity
1.3 Gravitational potential energy
Practical experiences
Chapter summary
Review questions
2
4
4
10
16
20
22
22
58
58
61
64
69
75
76
76
Module 1 Review
80
82
100
100
104
106
109
110
110
114
114
120
121
126
127
127
130
130
136
141
148
149
149
Module 2 Review
152
154
156
156
160
164
165
167
170
171
171
174
174
178
182
184
185
186
187
iii
Contents
188
189
190
193
197
201
202
202
Chapter 11 Superconductivity
11.1 The crystal structure of matter
11.2 Wave interference
11.3 X-ray diffraction
11.4 Crystal structure
11.5 Electrical conductivity and the crystal
structure of metals
11.6 The discovery of superconductors
11.7 The Meissner effect
11.8 Type-I and type-II superconductors
11.9 Why is a levitated magnet stable?
11.10 BCS theory and Cooper pairs
11.11 Applications of superconductors
Practical experiences
Chapter summary
Review questions
204
204
205
207
208
Module 3 Review
224
209
211
212
212
213
215
217
220
221
221
226
247
247
248
250
251
252
253
256
256
257
260
260
261
262
263
265
266
268
269
270
272
273
274
275
279
279
280
282
286
292
295
296
297
Module 4 Review
300
302
304
304
305
308
310
312
315
317
318
318
320
320
321
322
324
326
326
329
330
331
331
Contents
333
333
336
338
339
339
340
340
343
354
354
355
357
359
360
362
363
364
364
Module 5 Review
366
344
346
347
350
351
351
Module 6 Astrophysics
Module 6 Introduction
368
370
370
373
374
377
380
382
383
384
384
388
388
389
394
397
400
403
405
405
407
407
411
413
415
418
418
419
422
422
423
425
428
430
433
435
436
436
Module 6 Review
438
Module 7 Skills
Module 7 Introduction
440
442
442
443
445
446
447
447
448
449
Numerical answers
Glossary
Index
Acknowledgements
Formulae and data sheets
Periodic table
452
454
465
471
473
474
S
C
I
S
Y
PH
@ HSC
AGE FOR NSW STUDENTS
CK
PA
S
IC
YS
PH
E
ET
PL
M
CO
THE
in2 Physics is the most up-to-date physics package written for the NSW Stage 6 Physics syllabus. The
materials comprehensively address the syllabus outcomes and thoroughly prepare students for the HSC exam.
Physics is presented as an exciting, relevant and fascinating discipline. The student materials provide
clear and easy access to the content and theory, regular review questions, a full range of exam-style
questions and features to develop an interest in the subject.
From ideas to
on
implementati
From
cathode rays
to television
glass
anode (positive)
electrons
'boil' off
the heated
cathode
collimator
electron
beam
heater
cathode (negative)
Figure 8.5.3
electrons attracted
to the positive anode
Television
electron gun
magnetic
coils
fluorescent screen
Figure 8.5.5
mask
blue
beam
electron
guns
red
beam
R
G
B
green
beam
focusing
coils
Time
Figure 8.5.4
Time
mask
fluorescent
screen
fluorescent
screen
electron
beams
holes in
mask
Figure 8.5.6
guns
A colour CRT television set has three electron
that will only strike their respective coloured
phosphor dots with the aid of a shadow mask.
168
phosphor dots
on screen
vacuum
electron
beam
deflecting
coils
try this!
Do not aDjust your
horizontal!
If you have access to an old
black and white TV set or an old
style monochrome computer
monitor, try holding a bar
magnet near the front of the
screen and watch how the
image distorts. This occurs
because the magnetic field
deflects the electrons that strike
the screen. DO NOT do this with
used
Can an osCillosCope be
as a television set?
CheCkpoint 8.5
1
2
3
4
MODULE
A write-in workbook
that provides a
structured approach
to the mandatory
practical experiences,
both first-hand and
secondary-source
investigations.
Dot point and skills
focused.
from ideas to
tion
implementa
Chapter 8
Changing pressure of
discharge tubes
Physics skills
Aim
Equipment
them.
Risk assessment
Method
1
2
3
4
8.1.1.
Set up the equipment as shown in Figure
in the tube.
Observe the patterns and note the pressure
series.
Replace the tube with the next in the
and
Repeat the process of observing the patterns
your set.
in
tubes
the
noting the pressure for each of
DC power supply
tube
Figure 8.1.1 Induction coil and discharge
Hypothesis
HAZARD
Theory
69
68
2
Context
Motors and
Generators
l v = l0 1
tv =
mv =
Figure 4.0.2
try thiS!
CheCkpoInT 11.1
Figure 11.2.1 Two identical waves (red, green) travelling in opposite directions can add (blue)
t=1s
The interference of identical waves from two sources can also be represented
by outwardly radiating transverse waves (see Figure 11.2.3). The distance that a
twave
= 3 s travels is known as its path length. t = 6 s Constructive interference occurs
when the difference in the path length of the two waves is equal to 0, , 2, 3,
4 or any other integer multiple of the wavelength . Destructive interference
occurs
when the two waves are half a wavelength out of step. This corresponds to
t=4s
t=7s
a path length difference of /2, 3/2, 5/2 etc.
t=5s
lines of destructive
interference
lines of constructive
interference
t=4s
t=0s
waves
in phase
destructive
interference
constructive
interference
204
t=7s
1
2
evil tWinS
1 v2
1
m0c 1 + 2 = m0c 2 + m0v 2
2
2 c
E = mc 2
Figure 3.4.6
where m is any kind of mass. In relativity, mass and energy are regarded as the
same thing, apart from the change of units. Sometimes the term mass-energy is
used for both. m0 c 2 is called the rest energy, so even a stationary object contains
energy due to its rest mass. Relativistic kinetic energy therefore:
m0c 2
mv c 2 m0c 2 =
m0c 2
v2
1 2
c
Whenever energy increases, so does mass. Any release of energy is
accompanied by a decrease in mass. A book sitting on the top shelf has a slightly
higher mass than one on the bottom shelf because of the difference in
gravitational potential energy. An objects mass increases slightly when it is hot
because the kinetic energy of the vibrating atoms is higher.
Because c 2 is such a large number, a very tiny mass is equivalent to a large
amount of energy. In the early days of nuclear physics, E = mc 2 revealed the
enormous energy locked up inside an atoms nucleus by the strong nuclear force
that holds the protons and neutrons together. It was this that alerted nuclear
physicists just before World War II to the possibility of a nuclear bomb. The
energy released by the nuclear bomb dropped on Hiroshima at the end of that
war (smallish by modern standards) resulted from a reduction in relativistic mass
of about 0.7 g (slightly less than the mass of a standard wire paperclip).
Worked example
qUESTIon
When free protons and neutrons become bound together to form a nucleus, the reduction in
nuclear potential energy (binding energy) is released, normally in the form of gamma rays.
Relativity says this loss in energy is reflected in a decrease in mass of the resulting atom.
19
Imaging with
gamma rays
PRACTICAL EXPERIENCES
ChAPTER 19
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Perform an investigation to
compare a bone scan with
an X-ray image.
205
A bone scan is performed to obtain a functional image of the bones and so can be
used to detect abnormal metabolism in the bones, which may be an indication of
cancer or other abnormality. Because cancer mostly involves a higher than normal
rate of cell division (thus producing a tumour), chemicals
involved in metabolic processes in bone tend to accumulate in
higher concentrations in cancerous tissue. This produces areas
of concentration of gamma emission, indicating a tumour.
Compare the data obtained from the image of a bone scan
with that provided by an X-ray image.
Discussion questions
1 Identify the best part of the body for each of these
diagnostic tools to image.
2 Compare and contrast the two images in terms of
the information they provide.
Figure 19.6.1
Chapter summary
PHysicAlly sPeAking
Below is a list of topics that have been discussed throughout
this chapter. Create a visual summary of the concepts in
this chapter by constructing a mind map linking the terms.
Add diagrams where useful.
Radioactive
decay
350
Radiation
Radioisotope
Neutron
Proton
Beta decay
Gamma
decay
Antimatter
Bone scan
Positron
decay
Half-life
mEdICAL
PhySICS
Review questions
Figure 19.6.2
viii
73
constructive
interference
t=6s
1
2
72
Figure 11.1.1 Crystal structure of sodium chloride. The red spheres represent positive
sodium ions, and the green spheres represent negative chlorine ions.
1
Rearrange:
mvc 2 m0c 2 = (mv m0)c 2 m0v 2
2
In other words, at low speeds, the gain in relativistic mass (mv m0)
multiplied by c 2 equals the kinetic energya tantalising hint that at low speed
mass and energy are equivalent. It can also be shown to be true at all speeds,
using more sophisticated mathematics. In general, mass and energy are
equivalent in relativity and c 2 is the conversion factor between the energy unit
(joules) and the mass unit (kg). In other words:
c2
from ideaS to
implementation
v2
m0c 2 1 2
c
Surprising discovery
crystal, constructive interference,
destructive interference, path length,
diffraction grating, Bragg law,
phonons, critical temperature,
type-I superconductors,
type-II superconductors,
critical field strength, vortices,
flux pinning, BCS theory, Cooper pair,
coherence length, energy gap, spin
v2
= m0c 2 1 2
c
m0c 2
v2
1 2
c
Using a well-known approximation formula that you might learn at university,
(1 x )n 1 nx for small x:
InQUIRY ACtIVItY
mv c 2 =
m0
83
Superconductivity
c2
c2
82
11
v2
v2
PHYSICS FEATURE
Many of the devices you use every day have electric motors. They spin your DVDs,
wash your clothes and even help cook your food. Could you live without them,
and how much do you know about how they work?
The essential ingredients for a motor are a power source, a magnetic field
and things to connect these together in the right way. Its not as hard as you
think. All you need is a battery, a wood screw, a piece of wire and a cylindrical or
spherical magnet. Put these things together as shown in Figure 4.0.2 and see
if you can get your motor to spin. Be patient and keep trying. Then try the
following activities.
1 Test the effects of changing the voltage you use. You could add another
battery in series or try a battery with a higher voltage.
2 Try changing the strength of the magnet by using a different magnet or
adding another. What does this affect?
3 Try changing the length of the screw, how sharp its point is or the material
it is made from. Does it have to be made of iron?
t0
1
Space
How does this formula behave at low speeds (when v 2/c 2 is small)?
Figure 4.0.1
Seeing in a
weird light:
relativity
Isotope
reviewing
1
Nucleon
Alpha decay
PET
Table 19.6.1
Scintillator
Radioactive souRce
Radiation emitted
Half-life
C-11
Tc-99m
TI-201
I-131
Cs-137
U-238
+, g
g
g
, g
20.30 minutes
6.02 hours
3.05 days
8.04 days
30.17 years
4.47 109 years
351
Other features
from ideas to
implementation
4
extended response
III
II
multiple choice
(1 mark each)
1 Predict the direction of the electron in Figure 11.13.1
as it enters the magnetic field.
A Straight up
B Left
C Right
D Down
A
B
Figure 11.13.1
A
B
C
D
II
III
Conductor
Insulator
Insulator
Semiconductor
Insulator
Conductor
Semiconductor
Conductor
Semiconductor
Semiconductor
Conductor
Insulator
A
B
C
D
II
III
Critical
temperature
Superconductor
material
Critical
temperature
Normal material
Superconductor
material
Critical
temperature
Normal material
Normal material
Superconductor
material
II
Figure 11.13.2
Superconductor
material
Critical
temperature
II
III
Cathode
Striations
Anode
Cathode
Anode
Faradays
dark space
Striations
Faradays
dark space
10
11
12
Figure 11.13.3
Physics for FunTry This! activities are short, handson activities to be done quickly, designed to provoke
discussion.
Physics Features are a key feature as they highlight
contextual material, case studies or prescribed focus
areas of the syllabus.
III
II
Normal material
I
Striations
Faradays
dark space
Crookes
dark space
Cathode
Energy bands
Resistance ()
III
Temperature (K)
224
225
Practical experiences
The accompanying activity manual covers all of the
mandatory practical experiences outlined in the syllabus.
in2 Physics @ HSC Activity Manual is a write-in
workbook that outlines a clear, foolproof approach to
success in all the required practical experiences.
Within the student book, there are clear cross-references
to the activity manual: Practical Experiences icons refer to
the activity number and page in the activity manual. In
each chapter, a summary of possible investigations is
provided as a starting point to get
students thinking. These include
PRACTICAL
the aim, a list of equipment and
EXPERIENCES
Activity 10.2
discussion questions.
Activity Man
MODULE
ual, Page 94
Chapter 6
motors: magnetic fields
make the world go around
motors and
generators
aCtIVItY 6.2
First-hand investigation
Risk assessment
Method
Physics skills
Leave a straight piece (approx. 10 cm long) hanging out and then wind
the remainder of the wire around the box 2 times. Leave another
straight piece the same length as at the start, on the opposite side.
Wrap the straight pieces around the loops so that they tie both ends.
Fan out the loops so that you get equally spaced loops and that it
looks like a bird cage (see Figure 6.2.3).
Push out the middle of the paper clip as shown and Blu-Tack to
the bench.
Slip the straight pieces of wire through the paper clip supports.
Unwrap the cotton from these parts.
Place two magnets so that a north pole and a south pole face on
opposing sides of the cage.
Turn on. You may need to give the cage a tap to get it spinning.
Aim
Hypothesis
Theory
The motor effect means that a current-carrying wire experiences
a force when placed in a magnetic field. This is the basis for
the workings of a motor.
For a motor to work as needed, the motion resulting from
the motor effect needs to be circular and the force needs to be
adjusted so the direction of rotation does not change.
Question
Figure 6.2.1 shows the simplified workings of a motor that you
will be making. Label all the parts of the motor.
48
matchbox
wire
b
loop wire
through
paper clip
alligator clip
wires
power
source
How did adding more magnets affect how the motor ran?
Results
N
C:
D:
Blu-Tack
connecting wires with alligator clips
power supply
A:
B:
Equipment
ix
H2. analyses the ways in which models, theories and laws in physics
have been tested and validated
Focus: p. 79
Module 1 Space
1. The Earth has a gravitational field that exerts a force on objects both on it and around it
Students learn to:
Page Students:
13
Page
16
Act. 1.3
16
Act. 1.3
F = mg
to determine the weight force for a body on Earth and for the same body
on other planets
2. Many factors have to be taken into account to achieve a successful rocket launch, maintain
a stable orbit and return to Earth
Students learn to:
Page Students:
Page
7, 9,
23, 24
v = u + at
vy2 = uy2 + 2ayy
x = uxt
y = uyt +12 ayt2
18
identify data sources, gather, analyse and present information on the contribution 29
of one of the following to the development of space exploration: Tsiolkovsky,
Act. 2.1
Oberth, Goddard, Esnault-Pelterie, ONeill or von Braun
Act. 1.1
Stage 6
Physics syllabus grid
outline Newtons concept of escape
velocity
18
31
34
30, 33
25, 32, solve problems and analyse information to calculate the centripetal force acting
34, 37, on a satellite undergoing uniform circular motion about the Earth using:
54, 55
mv 2
F = r
43
define the term orbital velocity and the 36, 40, solve problems and analyse information using:
quantitative and qualitative relationship 56
r3
GM
=
between orbital velocity, the
2
4
2
T
gravitational constant, mass of the
central body, mass of the satellite and
the radius of the orbit using Keplers
Law of Periods
account for the orbital decay of
satellites in low Earth orbit
46
47
47
37, 54,
55
Act. 2.2
39, 43,
56
Page Students:
Page
13
present information and use available evidence to discuss the factors affecting
the strength of the gravitational force
Act. 1.3
11
23, 24,
25, 37,
54, 55
35, 38
44
xi
Stage 6
Physics syllabus grid
4. Current and emerging understanding about time and space has been dependent upon earlier models
of the transmission of light
Students learn to:
Page Students:
Page
62
62
Act. 3.2
62
58
60
Act. 3.1
58
66
65
analyse information to discuss the relationship between theory and the evidence
supporting it, using Einsteins predictions based on relativity that were made
many years before evidence was available to support it
78
65
79
v2
66, 69,
72, 77,
78
c2
t0
tv = 1
v2
c2
m0
mv = 1
discuss the implications of mass
increase, time dilation and length
contraction for space travel
v2
c2
70, 73
Page Students:
Page
92
Act. 4.1
xii
Stage 6
Physics syllabus grid
describe qualitatively and quantitatively 94
the force between long parallel currentcarrying conductors:
II
F
=k 1 2
l
d
94
115
92
Act. 4.1
90,
116
117
Act. 6.2
117
identify data sources, gather and process information to qualitatively describe the 91, 119
application of the motor effect in:
Act. 6.1
the galvanometer
the loudspeaker
115
115
2. The relative motion between a conductor and magnetic field is used to generate an electrical voltage
Students learn to:
Page Students:
100
Page
101
Act. 5.1
101
108
Act. 5.2
103
gather secondary information to identify how eddy currents have been utilised in
electromagnetic braking
Act. 5.2
113
105,
120
120
106
Page Students:
Page
131
Act. 5.1
135
135
Act. 7.1
135
141
Act. 7.2
146
Act. 7.3
147
xiii
Stage 6
Physics syllabus grid
4. Transformers allow generated voltage to be either increased or decreased before it is used
Students learn to:
Page Students:
Page
136
Act. 7.3
137
137
Act. 7.3
137
gather, analyse and use available evidence to discuss how difficulties of heating
caused by eddy currents in transformers may be overcome
139
Act. 7.3
139
gather and analyse secondary information to discuss the need for transformers in
the transfer of electrical energy from a power station to its point of use
145
Act. 7.3
142
136,
144
147
5. Motors are used in industries and the home usually to convert electrical energy into more useful forms
of energy
Students learn to:
Page
Students:
Page
124
Act. 6.3
gather, process and analyse information to identify some of the energy transfers
124,
and transformations involving the conversion of electrical energy into more useful 153
forms in the home and industry
Act. 7.3
Students:
Page
Page
Act. 8.1
Act. 8.2
Act. 8.2
162,
164
157
164
161
xiv
Stage 6
Physics syllabus grid
describe quantitatively the force acting 164
on a charge moving through a magnetic
field:
F = qvBsin
discuss qualitatively the electric field
strength due to a point charge, positive
and negative charges and oppositely
charged parallel plates
160
165
167
2. The reconceptualisation of the model of light led to an understanding of the photoelectric effect and
black body radiation
Students learn to:
Page
Students:
Page
182
Act. 9.1
identify data sources, gather, process and analyse information and use available
evidence to assess Einsteins contribution to quantum theory and its relation to
black body radiation
Act. 9.2
179
identify data sources, gather, process and present information to summarise the
use of the photoelectric effect in photocells
184
Act. 9.3
179
181
Act. 9.3
179
Act. 9.4
xv
Stage 6
Physics syllabus grid
3. Limitations of past technologies and increased research into the structure of the atom resulted
in the invention of transistors
Students learn to:
Page
Students:
Page
189
Act.
10.1
189
Act.
10.2
identify data sources, gather, process, analyse information and use available
evidence to assess the impact of the invention of transistors on society with
particular reference to their use in microchips and microprocessors
Act.
10.2
190
identify data sources, gather, process and present information to summarise the
effect of light on semiconductors in solar cells
Act.
10.3
199
193
193
199
4. Investigations into the electrical properties of particular metals at different temperatures led to the
identification of superconductivity and the exploration of possible applications
Students learn to:
Page
Students:
Page
process information to identify some of the metals, metal alloys and compounds 211
that have been identified as exhibiting the property of superconductivity and their
critical temperatures
209
Act.
11.1
209
Act.
11.1
209
identify that resistance in metals is
increased by the presence of impurities
and scattering of electrons by lattice
vibrations
gather and process information to describe how superconductors and the effects
of magnetic fields have been applied to develop a maglev train
Act.
11.1
215
217
xvi
Stage 6
Physics syllabus grid
Page
Students:
Page
230,
244
Act.
12.1
236
236
Act.
12.1
236
ni
nf
233,
245
Act.
12.1
231
analyse secondary information to identify the difficulties with the RutherfordBohr model, including its inability to completely explain:
the spectra of larger atoms
the relative intensity of spectral lines
the existence of hyperfine spectral lines
the Zeeman effect
Act.
12.2
n
n
f
i
237,
244
239
Page
Students:
Page
250,
259
249,
258
250,
257
gather, process, analyse and present information and use available evidence to
assess the contributions made by Heisenberg and Pauli to the development of
atomic theory
255
Act.
13.1
253,
257
xvii
Stage 6
Physics syllabus grid
3. The work of Chadwick and Fermi in producing artificial transmutations led to practical applications
of nuclear physics
Students learn to:
Students:
Page
Page
Act.
14.1
261,
275
solve problems and analyse information to calculate the mass defect and energy
released in natural transmutation and fission reactions
267,
277
263
263
269
266,
276
261
262
267
270,
275
271,
275
4. An understanding of the nucleus has led to large science projects and many applications
Students learn to:
Page
Students:
Page
280,
298
280
Act.
15.1
283,
298
identify data sources, and gather, process, and analyse information to describe
the use of:
a named isotope in medicine
a named isotope in agriculture
a named isotope in engineering
284,
Act.
15.2
xviii
292,
298
Stage 6
Physics syllabus grid
Page
Students:
Page
305
312
Act.
16.1
310,
311
identify data sources and gather information to observe the flow of blood through
the heart from a Doppler ultrasound video image
Act.
16.2
311
identify data sources, gather, process and analyse information to describe how
ultrasound is used to measure bone density
315
Act.
16.3
310
310,
311
I r Z2 Z 1
=
Io Z + Z 2
2
1
I r Z2 Z 1
=
Io Z + Z 2
2
1
310
312
315
Page
Students:
Page
321
Act.
17.1
322
gather secondary information to observe a CAT scan image and compare the
information provided by CAT scans to that provided by an X-ray image for the
same body part
Act.
17.1
326
Act.
18.1
329
Act.
18.1
334
336
337
xix
Stage 6
Physics syllabus grid
3. Radioactivity can be used as a diagnostic tool
Students learn to:
Page
Students:
Page
340,
343,
344
Act.
19.1
344
gather and process secondary information to compare a scanned image of at least Act.
one healthy body part or organ with a scanned image of its diseased counterpart 19.2
342
4. The magnetic field produced by nuclear particles can be used as a diagnostic tool
Students learn to:
Students:
Page
Page
Act.
20.1
identify data sources, gather, process and present information using available
evidence to explain why MRI scans can be used to:
detect cancerous tissues
identify areas of high blood flow
distinguish between grey and white matter in the brain
Act.
20.1
355
355
identify data sources, gather and process information to compare the advantages
and disadvantages of X-rays, CAT scans, PET scans and MRI scans
Act.
20.2
356
gather, analyse information and use available evidence to assess the impact of
medical applications of physics on society
Act.
20.3
357
xx
Stage 6
Physics syllabus grid
Module 6 Astrophysics
1. Our understanding of celestial objects depends upon observations made from Earth or from space
near the Earth
Students learn to:
Page
373
375
373,
378
Students:
Page
identify data sources, plan, choose equipment or resources for, and perform an
investigation to demonstrate why it is desirable for telescopes to have a large
diameter objective lens or mirror in terms of both sensitivity and resolution
377
Act.
21.2
2. Careful measurement of a celestial objects position in the sky (astrometry) may be used to determine
its distance
Students learn to:
Page
Students:
Page
388
solve problems and analyse information to calculate the distance to a star given
its trigonometric parallax using:
1
d =
p
Act.
22.1
388
Act.
22.2
389
Page
Students:
Page
390
Act.
22.3
390
analyse information to predict the surface temperature of a star from its intensity/ Act.
wavelength graph
22.4
393
395
393
xxi
Stage 6
Physics syllabus grid
4. Photometric measurements can be used for determining distance and comparing objects
Students learn to:
Page
Students:
Page
398
400
M = m 5 log
and
IA
= 100 (m
IB
d
10
mA)/5
399
Act.
22.5
401
identify data sources, gather, process and present information to assess the
impact of improvements in measurement technologies on our understanding of
celestial objects
Act.
22.6
401
397
5. The study of binary and variable stars reveals vital information about stars
Students learn to:
Page
Students:
Page
411
Act.
23.1
408
420
xxii
416
Stage 6
Physics syllabus grid
6. Stars evolve and eventually die
Students learn to:
Page
Students:
Page
423
Act.
24.1
428
analyse information from an HR diagram and use available evidence to determine 437
the characteristics of a star and its evolutionary stage
425,
430
425,
430
433
429,
431
437
xxiii
1
Context
Space
Modern physics was born twice. The first time (arguably) was in the 17th century
when Newton used his three laws of motion and his law of universal gravitation to
connect Galileos equations of motion with Keplers laws of planetary motion. Then
early in the 20th century, when many thought physics had almost finished the job of
explaining the universe, it was unexpectedly born again. Einstein, in trying to
understand the nature of light, proposed the special and general theories of
relativity (and simultaneously helped launch quantum mechanics).
Space was the common threadKepler, Galileo, Newton and Einstein were all
trying to understand the motion of objects (or light) through space.
Newtons laws of mechanics and his theory of gravitation led to space
exploration and artificial satellites for communication, navigation and monitoring of
the Earths land, oceans and atmosphere. Einsteins theory of relativity showed that
mass and energy are connected, and that length, mass and even space and time
are rubbery. Relativity has come to underlie most new areas of physics developed
since then, including cosmology, astrophysics, radioactivity, particle physics,
quantum electrodynamics, anything involving very precise measurements of time
and the brain-bending string theory.
So, whenever you use the global positioning system (GPS), consult Google
maps, check the weather report or make an international call on your mobile phone,
remember that the technology involved can be traced directly back to physics that
started 400 years ago.
Inquiry activity
Go ballistic!
The path through the air of an object subject only to gravity and air resistance,
is called a ballistic trajectory. If the object is compact and its speed is low, then
air resistance is negligible and its trajectory is a parabola.
Investigate parabolic trajectories using a tennis ball, an A4 piece of paper,
a whiteboard or a blackboard and a digital camera.
1 On a board about 2m wide, draw an accurate grid of horizontal and vertical lines
10cm apart.
2 With a firmly mounted camera, take a movie of a tennis ball thrown slowly in
front of the board. Try different angles and speeds to get eight or more frames
with the ball on screen, and get as much of a clear parabolic shape (including
the point of maximum height) as you can.
3 Using video-editing software, view the best movie, frame by frame, on a
computer. If your software allows it, create a single composite image with all
the balls positions shown on one image, to show the parabolic trajectory.
4 If you cant do that, then for each frame, on the board, and using the grid,
estimate the x- and y-coordinates of the balls centre to the nearest 5cm
or better. Some video software allows you to read the x- and y-coordinates
(in pixels) by clicking on the image.
5 Plot a graph of x versus y to produce a graph of the parabolic trajectory. The graph
might be a bit irregular because of random error in reading the blackboard scale.
6 Video the trajectory of a loosely crumpled-up piece of A4 paper. Now air
resistance is NOT negligible. Does the trajectory still look like an ideal parabola?
Cannonballs,
apples, planets
and gravity
What goes up must come down
space
Vertical displacement
Components of a trajectory
The ideal parabolic trajectory is an approximation that works under two
conditions:
1 Air resistance is negligible (gravity is the only external force).
2 The height and range (horizontal displacement) of the motion are both
small enough that we can ignore the curvature of the Earth.
a
Horizontal displacement
Figure 1.1.3 Trajectory of the riders projectile as seen by (a) the rider and (b) an observer on the ground
5
Cannonballs,
apples, planets
and gravity
The first condition is true for compact and low-speed projectiles. The second
is true in almost all human-scale situations, typically at or near the Earths surface.
Lets analyse an example of ideal projectile motion. Recall that the acceleration
due to gravity is g=9.8ms2 (see in2 Physics @ Preliminary section 1.3). Here we
are going to write it as a vector g. Clearly its direction is downwards.
Consider the trajectory of a ball. We start by separating the horizontal and
While the ball is in the air, the only
vertical components of its motion.
external force on it is gravity acting downwards, so there is a constant vertical
acceleration ay=g, illustrated by the changing vertical spacing of projectile
positions plotted at equal time intervals in Figure 1.1.2.
The net horizontal force is zero, so, consistent with Newtons first law,
horizontal velocity is constant (ax=0), which is clear from the equal horizontal
spacing of the projectile positions plotted at equal time intervals in Figure 1.1.2.
We can recycle the kinematics (SUVAT) equations from the Preliminary
course. (See in2 Physics @ Preliminary section 1.3.)
(4)
s = vt
(1)
s = ut +1 at2
2
u+v
s=
t
(2)
(5)
v2 = u2 + 2as
2
v = u + at
(3)
PRACTICAL
EXPERIENCES
Activity 1.1
Table 1.1.1
Horizontal components
Vertical components
ux = u cos i
uy = usini
vx = ux
vy = uy + gt
x = uxt
1 2
gt
y = uyt +
vx2 = ux2
75
90
60
45
Here we need to apply them separately to the vertical (y) and horizontal (x)
components of motion. Instead of displacement s, well use x=xfxi for
horizontal displacement and y=yfyi for vertical displacement. Well put
subscripts on the initial and final vertical velocities (uy and vy for example). We
only need to use SUVAT equations 3, 4 and 5. i is the launch angle (between
Remember to adjust the sign
the initial velocity u and the horizontal axis).
of g to be consistent with your sign convention. In problems involving gravity, up
is normally taken as positive, making the vector g negative (i.e. g = 9.8 m s2).
In the syllabus, vx2 = ux2 is included for completeness; but is unnecessary,
as it can be derived from vx=ux.
Some properties of ideal parabolic trajectories are:
At the maximum height of the parabola, vertical velocity vy=0.
The trajectory is horizontally symmetrical about the maximum height position.
The projectile takes the same time to rise to the maximum height as it takes to
fall back down to its original height.
For horizontal ground, initial speed = final speed.
Maximum possible height occurs for a 90 launch angle. The maximum
possible range (for horizontal ground) occurs for a 45 launch angle
(Figure 1.1.4).
Independent of their initial velocity, all objects projected horizontally from the
same height have the same time of flight as one dropped from rest
from the same height, because they all have a zero initial vertical velocity
(Figure 1.1.5).
30
15
space
100mm
Ballistics is a drag
Target practice
You now have all the equations you need to do
some damage, so lets launch some projectiles.
Safety warning! The following worked example
may seem dangerously long because it illustrates
several alternative methods of solving projectile
problems rolled into one.
Worked example
Question
You throw a ball into the air (Figure 1.1.7). You release the ball 1.50m above the ground,
with a speed of 15.0ms1, 30.0 above horizontal. The ball eventually hits the ground.
Answer the following questions, assuming air resistance is negligible.
a For how long is the ball in the air before it hits the ground (time of flight)?
b What is the balls maximum height?
c What is the balls horizontal range?
d With what velocity does the ball hit the ground?
1.50 m
Cannonballs,
apples, planets
and gravity
Solution
Always draw a diagram! Divide the motion into vertical (y) and horizontal (x) components.
Choose the origin to be the point of release, so xi=yi=0.
This is not always the most convenient choice of origin.
al
yu
ial
ini
i = 30.0
it
loc
ve
uy
ve
vy
loc
ity
ux
vx
Find final vertical velocity: vy2 = uy2 + 2gy = 7.502 + 29.801.50 = 85.65
vy =
1
Alternative method using the quadratic formula y = uyt + gt2 = 1.50m
2
1
2
Substitute, rearrange: 1.50 + 7.50t + 9.80t = 0
2
7.5 7.502 + 4 4.90 1.50
= 0.179s or +1.71s
2 4.90
Alternative method
1
Use vy = uy + gt to find the time t when vy = 0, then use y = uyt + gt2 to find
2
vertical displacement.
space
d x-component of final velocity: vx = +13.0ms1
This is the same as for the previous method within the three-figure precision of the
calculation, but doesnt tell us the direction.
Worked example
Question
Suppose you kick a ball at 22.0ms1, 20.0 above the horizontal, towards a wall 21.0m
away (Figure 1.1.9). Ignore air resistance and the balls radius.
a What is the balls time of flight (before hitting the wall)?
b At what height does the ball hit the wall?
c Is that the greatest height reached by the ball?
Solution
Choose the origin to be the initial position, so xi=yi=0. Use the sign convention +
and +.
ux= 22.0cos20.0 (right) = +20.7ms1
uy= 22.0sin20.0 (up) = +7.52ms1
9
Cannonballs,
apples, planets
and gravity
a The ball hits the wall when the horizontal displacement x=+21.0m.
= 1.014s 1.01s
+20.7 m s1
1
b The ball hits the wall at a height (vertical displacement) of y = uyt + gt2.
2
Rearrange, solve: t =
c Check if the ball reaches maximum height of the parabola before hitting the wall.
Rearrange, solve: t =
7.52
= 0.767s which is less than time of flight
9.80
The ball would reach the maximum height of the parabola before hitting the wall,
therefore the final height is NOT the maximum height for the trajectory.
Checkpoint 1.1
1
2
3
4
5
6
Determine the horizontal acceleration of a projectile in flight. Determine its vertical acceleration. (Assume
negligible air resistance.)
What angle of launch gives maximum horizontal range? What angle of launch gives the maximum possible height?
(Assume negligible air resistance.)
What is another name for air resistance?
If you throw a ball horizontally from the roof, and drop another at the same time, which one will hit the ground first?
Describe the two conditions that must apply so that a trajectory is a parabola.
List the 8 equations used in calculations of projectile motion. Explain why at least one of them is unnecessary.
1.2 Gravity
In Ptolemys universe, the Sun, Moon and planets each had a separate clockworklike mechanism to keep it in motion. Copernicus and Kepler greatly improved
the picture, but Isaac Newton finally showed there was a single mechanism for
them allthe force of gravity.
The calculations of parabolic trajectories in section 1.1 work well close to the
Earths surface where g is constant. However, if were going to venture out into
space, we cant use these simple equations. We need to look at the force of gravity
on a larger scale.
space
Like light intensity, the magnitude of the force decreases with distance
according to the inverse square law (see in2 Physics @ Preliminary sections
6.1 and 15.1). However, astronomer Ismael Boulliau had suggested this
before him.
The law of gravitation is universalit applies throughout the universe and is
responsible for the orbits of all the planets and moons.
All this is expressed mathematically as the law of universal gravitation:
FG = G
m1m2
d2
Try this!
Slightly attractive
Worked example
Question
Calculate the gravitational force between the Earth and the Moon.
Data: Earths mass mE = 5.971024kg
Solution
FG = G
mEmM
dEM 2
= 1.981020 N
11
Cannonballs,
apples, planets
and gravity
Now lets try an example with more than two masses.
Worked example
Moon
Question
Earth
spacecraft
A 1000kg spacecraft is in the vicinity of the EarthMoon system. The spacecraft is at the
origin, the Moon is on the positive yaxis and the Earth is on the positive xaxis (Figure
1.2.2). Given that the Earthspacecraft and Moonspacecraft distances are 3.82108m
and 3.91107m respectively, calculate the resultant gravitational force on the
spacecraft.
Data: Earths mass mE = 5.971024kg
Solution
Force due to Moon: FSM = G
FSM
Fres
spacecraft
d SM 2
mSmM
mSmE
d SE 2
(3.91 107 )2
PHYSICS FEATURE
1. The history of
physics
3. Applications and
uses of physics
space
PRACTICAL
EXPERIENCES
Activity 1.2
Boredom part 2
t is said that, at age 17, Galileo was attending church and, bored,
was watching a lantern swing from the ceiling. Using his pulse as a
stopwatch, he observed that the oscillation period of a pendulum barely
changed as its amplitude gradually decreased. Back at home he started
experiments confirming that the oscillation period depends on pendulum
length L, but not at all on mass and only slightly on amplitude. He
proposed (correctly) that pendulums could be used to create the first
accurate mechanical clocks.
We now know that, consistent with Galileos observations, for a simple
mass-on-string pendulum the formula for oscillation period T is:
T = 2
L
g
Cannonballs,
apples, planets
and gravity
Divide both sides by test mass m:
g=
FG
M
=G 2
m
d
Worked example
Question
Figure 1.2.6 Gravitational field lines around the
Earth (a) on an astronomical scale
and (b) near the surface
Solution
gE =
GM E
d2
gE=
= 9.81 ms2
PRACTICAL
EXPERIENCES
Activity 1.3
14
space
nte
ractiv
measurements that gets more severe as one approaches the equator. Because of the
Earths rotation, the (downward) centripetal acceleration (see
in2 Physics @ Preliminary section2.3) of the ground appears to be subtracted from
the true value of g. In fact this centripetal effect is responsible for the formation
of the equatorial bulge, which was predicted by Newton before it was measured.
The Sun and Moon also exert a weak gravitational force on objects at the
Earths surface, so the magnitude and direction of g vary slightly, depending on
the positions of the Sun and Moon. Variation in g caused by the positions of the
Sun and Moon relative to the oceans is responsible for the pattern of tides.
Strictly speaking, Newtons gravitation equation written in the form
above assumes that the planet is a perfectly uniform sphere. Close to the surface
of a planet, local deviations from uniform density can result in small local changes
in the magnitude and direction of g. The magnitude will be slightly larger than
average when measured on the ground above rock (such as iron ore) of high
density (mass per unit volume) and lower above rock containing low-density
minerals (such as salt or oil), an effect exploited by geologists in mineral
exploration. The Earths crust is less dense than the mantle, so variations in
thickness of the crust also affect g. Variation in g is measured using a gravimeter,
the simplest kind being an accurately known mass suspended from a sensitive
spring balance.
Variations in g on larger distance scales around the Earth can be measured
using satellites orbiting in low Earth orbit. Deviations in the orbital speed of
satellites indicate that, in addition to the equatorial bulge, Earth is also slightly
pear-shapedpointier at the North Pole than the South Pole.
M o d u le
Hookes law
haviour of springs
notes on the be
Figure 1.2.7 Hookes
15
Cannonballs,
apples, planets
and gravity
Checkpoint 1.2
1
2
3
4
5
6
For clarity well use the symbol EP instead of U to denote gravitational potential
energy calculated using the more accurate formula, even though the two symbols
are really interchangeable. Potential energy is energy stored by doing work
against any force (such as gravity) that depends only on position; therefore,
gravitational potential energy EP is energy stored by doing work against the force
of gravity. It can be shown (using calculus to derive the work done against
gravity by changing the separation of two masses) that:
E P = G
m1m2
r
space
+GmtmE
r E2
FG
rE
2rE
3rE
4rE
5rE
separation
GmtmE
rE
EP
Figure 1.3.1 Plots of gravitational force (FG) and gravitational potential energy (EP) versus separation
between a test mass mt and the Earth mE, starting at one Earth radius rE. The vertical FG
and EP axes are not drawn to the same scale.
Worked example
Question
A piece of space junk of mass mJ drops from rest from a position of 30000km from the
Earths centre. Calculate the final speed vf it attains when it reaches a height of 1000km
above the Earths surface. Assume that above 1000km, air resistance is negligible.
Data: Earths mass mE = 5.971024kg
Solution
Air resistance is negligible, so total mechanical energy (kinetic + potential energy) is
conserved. Assume that because of the enormous mass of the Earth, its change in velocity
is negligible. Use the Earth as the frame of reference. Dont forget to convert to SI units.
Cancel mJ:
Substitute:
K i + EPi = Kf + EP
m
m
mm
1
1
mJvi2 G J E = mJvf2 G J E
ri
rf
2
2
0 6.67 1011
5.97 1024
1 2
5.97 1024
11
=
6.67
10
v
f
30.0 106 2
(6.37 + 1.00) 106
106 106
7.37 30.0
= 9030 m s1 = 9.03 km s1
Note that this result doesnt depend on mJ.
17
Cannonballs,
apples, planets
and gravity
Isaac Newton showed that what goes up doesnt necessarily come down. Normally,
if one fires a projectile straight up, the object will decelerate until its velocity
changes sign and it falls back down. However, if a projectiles initial velocity is
high enough, the 1/d2 term in the gravity equation will cause the acceleration g to
decrease with height too rapidly to bring the projectile to a stop so it will never
turn backit can escape the planets gravitational field. The minimum velocity
that allows this is called the escape velocity. Strictly speaking, its really a speed,
because the initial direction of the projectile isnt critical.
Newton treated the projectile as a cannonball (with no thrust) so that, other
than the initial impulse from the cannon, the only force acting on it is gravity.
He conceived escape velocity using his force equation, and the escape velocity
formula can be derived from it. However, a more modern derivation using energy
is easier and similar to the previous worked example.
Let m be the mass of a projectile, M the mass of a planet, ve the initial speed
and r the initial position (the planets radius if you are on the surface). Assume air
resistance is negligible, so total mechanical energy (KE + GPE) is conserved (see
in2 Physics @ Preliminary section 4.2).
Ki + EPi = Kf + EPf
The escape velocity represents the minimum limiting case where the projectile
just reaches infinite displacement with zero speed; in other words, Kf=EPf=0.
1
GmM
mve 2
=0+0
2
r
Rearrange, cancel m:
ve =
Explain the concept of escape
velocity in terms of the:
gravitational constant
mass and radius of the planet.
2GM
r
If the initial speed is greater than this, the projectile will maintain a non-zero
speed even as it approaches infinite displacement. Note that the escape velocity
depends only on the planets mass and the projectiles starting position r but not
on the projectiles mass.
You may be puzzled that in the above derivation, the total mechanical energy
(sum of KE and GPE) was exactly zero. This means that the escaping projectile
has just enough (positive) KE to overcome its negative potential energy. When
the mechanical energy is less than zero, there is not enough KE to overcome the
GPE and the two masses are said to be gravitationally bound. When the total
mechanical energy ME>0, the KE can overcome the GPE and the two bodies
are no longer bound together. This concept of binding also applies to the other
three fundamental forces (including electromagnetism, which binds electrons to
the nucleus of an atom).
The escape velocity from the Earths surface is:
2 6.67 10 11 5.97 1024
6.37 10
18
= 11 200 m s 1 = 11.2 km s 1
space
This idealised escape velocity needs to be modified when applied to real
spacecraft. First, the derivation ignores air resistance in the atmosphere
(hundreds of kilometres thick), which would increase the escape velocity.
Second, in a real rocket, engines produce an extra forcethrustthat can
accelerate a craft to a higher altitude where the escape velocity is lower. It also
ignores other sources of gravitational fields such as the Sun, Moon and planets.
The escape velocity for a projectile under the gravitational influence of more
than one body is given by:
ve total = ve12 + ve22 +
where ve total is the escape velocity for the total system and ve1, ve2 are the
escape velocities from the individual bodies within the system, calculated for the
projectile using the same starting position in space.
Ultimate frisbee
as the first artificial object to leave the solar system a giant steel
frisbee? In the 1950s, the US started testing nuclear bombs
underground, to minimise atmospheric nuclear fallout. In 1957, during
Operation Plumbbob in the Pascal-B test, a nuclear bomb was detonated at
the bottom of a 150m shaft sealed with concrete and a 900kg, 10cm thick
steel cap. The steel cap fired upwards at enormous speed and was never
seen again. Before the test, it was estimated that an extreme upper limit for
the speed of the steel cap would be 67 kms1. This is well above the
escape velocity for the whole solar system (43.6 kms1 from Earth), starting
an urban myth that it beat the Voyager probes (launched in 1977) out of the
solar system. A later, more realistic, estimate suggested that, at most, the
cap had a speed of 1.4 kms1, reaching an altitude of less than 95 km.
Checkpoint 1.3
1
2
3
4
5
Define under what circumstances it is suitable to use the simplified formula U = mgh for gravitational potential
energy (GPE).
Write down the more accurate formula for GPE.
What limit does GPE approach as the separation of the two masses approaches infinity?
On what factors does Newtons idealised escape velocity depend?
What other factors affect escape velocity in realistic situations?
19
Cannonballs,
apples, planets
and gravity
PRACTICAL EXPERIENCES
CHAPTER 1
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
A ball is rolled down a ramp, whose dimensions will be known to you. Predict where
the ball will land.
Equipment: aluminium track, ball bearing, metre ruler, measuring tape, shoe.
ball bearing
track
ruler
Discussion questions
1 List assumptions you have made in order to make an estimate of the range.
2 Assess how reliable is your method.
3 Explain how changing the original angle of the ramp will affect the range
of the ball.
retort stand
string
mass
20
space
Discussion questions
1 Explain what you did in order to make the experiment reliable.
2 Galileo originally thought that the period of the pendulum did not depend
at all on the amplitude of the swing. Is this true? Explain how you can take
this into account in your experiment.
3 How does your value compare with the accepted value?
4 Outline another method that would allow you to achieve the same aim.
21
Cannonballs,
apples, planets
and gravity
Chapter summary
ve =
2GM
r
Review questions
Physically speaking
Complete each definition by using a keyword taken from the list at the beginning of the chapter.
To approach infinite distance from a massive central body, a projectile must start with _________________.
The path of a projectile is known as a _________________.
The formula for converting velocities between frames of reference is the _________________.
A projectiles maximum horizontal displacement is its _________________.
Universal gravitation and the intensity of light both follow the _________________.
Close to the Earths surface and subject only to gravity, a projectiles path is a _________________.
The acceleration of a _________________ near the central body equals the gravitational field.
Close to the Earths surface, all objects projected horizontally from the same height have the same_________________.
A _________________ is apparatus used to assist in mineral exploration.
If drag is negligible, then a projectiles range is determined only by initial velocity and _________________.
22
space
Reviewing
1 Given that the Earth rotates, account for why when you jump straight up,
you land on the same spot.
2 The high jump and the long jump both involve a run-up and then a jump.
Using ideas from projectile motion, briefly compare and contrast the ideal
characteristics of the run-up and jump for the two sports.
3 A projectile takes 1.25s to reach its maximum height. What is its time of
6 Describe how (and explain why) g would differ slightly from average at a point
on the Earths surface above an oil deposit.
7 Youve seen diagrams of electrical field lines around positive charges in which
the arrows point outwards (see in2 Physics @ Preliminary section 10.6).
Briefly discuss the possibility of a planet with gravitational field lines that
point outwards. Propose how you would expect a test mass to behave there.
8 Without doing a calculation, deduce the speed at which a meteorite would hit
the Earths surface if it started from rest at a very large distance from the
Earth. Justify your answer. Ignore air resistance and gravity of other
astronomical bodies. (Hint: The value is one already calculated elsewhere in
this chapter.)
9 Read the definition of gravitational potential energy EP in section 1.3 page 16.
Explain why it is necessary to specify in the definition that the work is done
with no net change in speed. (Hint: What other form of energy is involved?)
Solving problems
10 Repeat the calculation in the worked example accompanying Figure 1.1.7,
assuming that the ball lands on the flat roof of a 2.5m high garage, instead
of the ground.
derive an expression (containing initial speed u and launch angle ) for the
time taken for a projectile near the Earths surface to reach its maximum
height. Then show that the time of flight for a projectile fired over horizontal
ground is given by:
2u
t=
sin
g
23
Cannonballs,
apples, planets
and gravity
13 A marble rolls horizontally off the edge of a 1.00m high table with a speed
of 3.00ms1. Calculate the speed with which it hits the ground, by:
a using the equations of projectile motion
b assuming the conservation of mechanical energy (using the simple
version of the equation for GPE).
15 Using your own mass, calculate the maximum force of gravity exerted by
18 Using the data and answer from Question 17, calculate the speed at which
you would need to project the 10kg object radially outwards from the
initial position so that it would just reach the final position, stop and fall
back to Earth. (You can ignore air resistance above an altitude of
~1000km.)
19
Extension
Solve problems and analyse
information to calculate the
actual velocity of a projectile
from its horizontal and vertical
components using:
v x2 = u x2
v = u + at
vy2 = uy2 + 2ayy
1
x = uxt; y = uyt + ayt2
2
Re
24
iew
Q uesti o
Show that the dart will hit the monkey. (Hint: Show that by the time the
dart reaches the horizontal position of the monkey, both the dart and
the monkey have the same vertical position. Assume that air resistance
is negligible.)
space
PHYSICS FOCUS
How to weigh the Earth
torsion wire
F m
Figure 1.4.4 (a) Schematic and (b) cutaway view of the apparatus
used by Cavendish to weigh the Earth
the definition =
mass
and the mean radius
volume
25
Explaining
and exploring
the solar system
Getting up there
How many times have you been told to stop dreaming and
be practical? For scientists and engineers, both dreams
and practical know-how were potent tools to turn the
understanding of the physics of gravity and motion into the
technology of space travel. Most of the important pioneers
of rocketry were inspired to pursue dreams of space travel
by reading Jules Vernes (18281905) story From the
Earth to the Moon, or the stories of HGWells
(18661946). But they also had a solid
grounding in physics and engineering.
For most of the history of rocketry, starting with the invention by the
Chinese of gunpowder (the first rocket fuel or propellant) sometime
between 300bc e and 850 c e , the technology was driven mainly by military
applications. The Chinese invented the first rockets or fire arrows (fireworks
tied to arrows). Some of the early milestones of this history are summarised
in Table2.1.1 in the Physics Feature Fire Arrows on page 29.
Only in the 20th century were civilian and scientific applications
of rocketry (space exploration, Earth monitoring and communications)
finally considered to be potentially as important as the military ones.
space
Here well concentrate on the important
rocket researchers of the 20th century, the period
in which the most rapid scientific advances took
place. Below is a list their most important
contributions.
PRACTICAL
EXPERIENCES
Activity 2.1
Explaining
and exploring
the solar system
28
space
Lagrange points. (These are five stable locations around pairs of orbiting
bodies such as Earth and Moon at which a test mass can remain indefinitely,
requiring little or no thrust.) He suggested that colonists would live on the
inner surface of these cylinders 3km in radius and 20km long. The cylinders
would spin, using centripetal force, to simulate gravity, and the inside would be
covered with Earth-like geography.
PHYSICS FEATURE
Fire arrows
he following table is a
very incomplete summary
of some of the highlights of
the 24-century long history
of rocketry.
At some time between these dates, the Chinese invent gunpowder and fireworks.
11501200
The Chinese develop the first rockets, fire arrows (fireworks tied to arrows), and
projectile weapons including grenades and cannons are used against invading
Mongols.
12001300
Invading Mongols bring Chinese rocket technology to Europe and the Arabian
Peninsula.
15291556
Conrad Haas (Austria) proposes the first designs for multi-staged rockets.
1687
~1730
German Colonel von Geissler manufactures rockets (up to 54 kg) for warfare.
1792, 1799
Sultan Tipu (India) uses iron-cased 1km range rockets against British troops.
18031806
Impressed by Tipu, Sir William Congreve (Britain) develops more accurate 3km
range rockets up to 136 kg, which were used successfully against Napoleons ships
and against the Americans in the war of 1812.
19th century
1821
18611865
1865
Science fiction writer Jules Verne (France) publishes From the Earth to the Moon.
1903
29
Explaining
and exploring
the solar system
This IS rocket
science!
Thrust
Tsiolkovsky called a rocket a reaction device. This is because the burning
propellant forms hot, high-pressure exhaust gas that is forced through the nozzle
at high exhaust speed ve. By Newtons third law (see in2 Physics @ Preliminary
section 3.5), the large force exerted on the exhaust gas results in a reaction (called
thrust) back on the rocket, pushing it forward. You can use momentum to
calculate the thrust. (See in2 Physics @ Preliminary Worked example, p70.)
Lets analyse the [rocket + propellant + exhaust] as our system. The forces
ejecting exhaust out of the nozzle are internal forces, so they cant change the
net momentum of the system; therefore, momentum gained by the ejected
exhaust (meve) must be cancelled by the [rocket + propellant] gaining equal
and opposite momentum. An increase in momentum (impulse) of the
[rocket + propellant] implies acceleration and, hence, a force (called thrust FT).
Suppose the speed of the exhaust gas ve is constant over a time period t.
The impulse J=FTt exerted on the [rocket + propellant] and the impulse
(meve) exerted on the exhaust gas are equal in magnitude:
J=FTt = (meve) = veme
Rearrange:
FT = v e
me
t
FTwhere
= ve
Worked example
Question
The thrust equation doesnt only apply to rockets. A fireman holding a hose was not prepared
when the water was turned on and was knocked over by the unexpected thrust. Water exited
the spout with a speed 39.0ms1 and with a flow rate 470Lmin1.
Calculate the force that knocked him over. (Water density is 1000 kg m3 = 1.00kgL1.)
Solution
Mass flow rate of water = 470Lmin11.00kgL1/60s = 7.833kgs1
me
Equation for thrust: FT = ve
= 39.0ms17.833kgs1 = 305N
t
space
vertically. More recent rockets such as the US Space Shuttle and the European
Ariane 5 use a combination of liquid- and solid-propelled stages stacked side-byside. Russian rockets, such as the Proton, use only liquid propellant. Of course,
multi-stage rockets are also more complicated and so have more ways to fail.
In the US, smaller satellites and military missiles are launched using simpler,
solid-fuel rocket engines.
fuel
solid fuel
oxidiser
mixture
g-force
oxidiser
Maybe youve heard that astronauts are squashed by g-force when a rocket
accelerates on take-off. Often the term g-force is used to quantify the effects on
pumps
your body of accelerations experienced in a roller-coaster, car or aeroplane. Its not
combustion
an accurate name, because its value is more closely related to acceleration than
chamber
force, but it is used extensively in aeronautics and astronautics, so well give you
combustion
a commonly used definition. The g refers to acceleration expressed in units of
chamber
g=9.8ms2. The force refers to the fact that a net external force is responsible for
nozzle
nozzle
that acceleration and it is this force and resulting reaction forces within a body that are
responsible for the effects of g-force. Sometimes the term g-load is used instead.
exhaust
exhaust
Lets start with the vertical component of motion and g-force. Consider three
Figure 2.1.10 Typical (a) solid-propellant
situations:
and (b) liquid-propellant
1 While you are sitting or standing still (not accelerating), there is no net force
rocket engines
on you. Your body is compressed by a pair of balanced forcesweight mg
downwards, and the upward normal force N from the seat or floor. (See in2
Physics @ Preliminary p45.) This compression causes the internal effects of
weight. Your body is experiencing the compressive effect of 1 unit of Earth
gravity (i.e. g-force = 1).
2 If you are accelerating upwards, such as riding the bottom of a curve on
a roller-coaster or in a rocket during take-off, the net force is upwards
the normal force from the seat is larger than your weight. Your body compresses
more than usual, as though you are heavier. If your net upward acceleration is
9.8ms2 (1g), then your body is compressed as though you are in a
gravitational field of 1+1=2 units of Earth gravity (i.e. g-force=2).
3 In free-fall (or in orbit), the normal force from the chair disappears and you
Identify why the term g forces
are no longer compressed. You feel effectively weightless (see in2 Physics @
is used to explain the forces
Preliminary pp3738), even though at typical Space Shuttle altitudes you
acting on an astronaut during
actually have ~90% of your weight on Earth. In this case you are experiencing
launch.
g-force=0, the same effect as 0 units of Earth gravity.
The term g-force usually means apparent weight divided by true weight
on Earth. Apparent weight is what appears on a set of bathroom scales. Bathroom
scales actually measure the magnitude of the normal force, not true weight, so,
to calculate g-force, first calculate normal force. Consider vertical motion only.
Weight mg is down and normal force N is up. Let av be vertical acceleration and
let up be positive:
Fnet = mav = N + (mg)
Apparent weight N:
N = mav + (+mg)
The apparent weight increase is caused by the increase in normal force N due
to the term mav, which is simply the net force accelerating you.
Vertical g-force = N mav + ( + mg)
=
mg
mg
a
Vertical g-force = gv + 1
31
Explaining
and exploring
the solar system
The First
Astronauts?
The agv term in the g-force formula represents the effects of your acceleration
due to the net force and the +1 represents the background effects of your real
weight.
Notice that in free-fall your acceleration is g, so the g-force is 1+1=0.
If you are holding a rope or are strapped into a harness being pulled upwards
instead of sitting or standing, then the above discussion and formulae still apply
except that now your body is being stretched and the normal force is replaced by
the tension force.
The horizontal component of g-force, such as when you accelerate or
decelerate in a car, is easier to calculate. (See in2 Physics @ Preliminary Physics
Phile gWhiz p11.) As there is no horizontal component of weight mg, the
g-force equation simplifies:
Horizontal g-force = agh
To calculate the resultant g-force, combine the vertical and horizontal
components of g-force using vector addition.
Worked example
Question
You round the bottom of an upturning curve on a roller-coaster at a speed of 36.0km h1.
The curve is circular with radius 5.00m. Calculate the g-force you would experience.
Solution
Analyse the forces involved in
uniform circular motion for
a range of objects, including
satellites orbiting the Earth.
At that moment, you experience (upward) centripetal acceleration (see in2 Physics @
Preliminary section 2.3). Calculate the vertical g-force.
36
v = 36 km h1 =
m s1 = 10 m s1
3.6
ac =
Vertical g-force =
v 2 102
=
= 20.0 m s2
R 5.00
av
+20.0
+1 =
+ 1 = 3.04
g
9.80
One effect of g-force is that the apparently increased weight of the blood
drains it from the head, affecting vision and consciousness. On average, 45 g
causes dimming of vision, 56 g visual blackout and above 6 g you experience
loss of consciousness (g-LOC). Much larger g-forces can be tolerated for periods
of less than about 4seconds.
To increase g-force tolerance during launch, astronauts face the direction of
acceleration. This orientation is called eyeballs in, because the eyeballs are
effectively pushed into their sockets. Also, the seats are oriented with the head
and body lying horizontally (Figure 2.1.11). In this way, g-force doesnt easily
force blood into or out of the head. Fighter pilots and astronauts also wear
gsuits containing inflatable bladders in the trousers which squeeze blood out of
the legs and back into the head.
Using a powerful cannon to launch a satellite (see Newtons thought
experiment in section 2.2) would not work because of the enormous g-force
from the initial explosion.
32
space
Warning! The terms g-force and g-load are not SI quantities. They are
informal terms and are sometimes used carelessly. Sometimes g-force and g-load
are used to mean the same thing. Sometimes g-load is used to mean only the net
acceleration in units of g, not including the effect of gravity. When reading
g-force or g-load data, be careful to check which definition is being used.
33
Explaining
and exploring
the solar system
external tank
separation
C solid rocket
E in orbit
D accelerating up
booster
separation
to orbital speed
3.5
3.0
B reduced
air resistance
g-force
2.5
lift-off
2.0
1.5
1.0
0.5
0
E
100
200
300
400
500
600
Time (s)
Earths rotation
Running start
It takes a lot of fuel to get a spacecraft to a high enough altitude and high
enough speed to achieve orbit. You can get higher if (like a pole vaulter) you get
a run-up before lift-off. Given that Earth rotates rapidly, a rocket already has a
large easterly tangential velocity at launch. So, if you launch towards the east,
you can use less propellant, carry a larger payload or go into a higher orbit. The
closer you are to the equator, the faster your initial speed u (Figure 2.1.13).
At the equator u=465ms1.
Worked example
Question
Compare Earths rotational tangential speed vT at the rocket launch facilities at Woomera,
Australia (used during the 60s and 70s), and Kourou, French Guiana.
v1
v2
tangential velocity
Data:
Kourou:
Solution
Tangential speed: v T =
space
vT =
2rE
2 6.37 106
cos lat =
cos lat = 465 cos lat
T
86164
direction of Sun
orbital
motion
Earth
Checkpoint 2.1
1
2
3
4
5
6
7
8
Which two people led the space race between the USSR and the US?
What event triggered the US government to fund a large civilian rocket program?
What are Lagrange points?
Name the two basic kinds of rocket engine.
Explain why a multi-stage rocket allows a heavier payload.
What is the vertical g-force on someone:
a standing stationary?
b in free-fall?
c in orbit?
Explain why the acceleration of a firing rocket increases with time.
Explain one reason why most rockets are launched towards an easterly direction.
35
Explaining
and exploring
the solar system
curving it downwards until impact. If you increased the projectiles initial
velocity, it would travel further around the Earth and its trajectory would be less
steeply curved. Eventually you would reach an initial velocity at which the
curvature of the trajectory exactly matched the curvature of the Earth itself, so
that the projectile would never catch up to the groundthe projectile was now
in a circular orbit. The projectile would become an artificial satellite (an object
in orbit around a much larger one) in the same way that the Moon is a natural
satellite of Earth. Newton also showed that if you increased the velocity further,
the orbit would become an ellipse. At high enough velocity (escape velocity), the
projectile would never return.
PRACTICAL
EXPERIENCES
Activity 2.2
Vi
B
Vf
R
R
Vf
Vi
36
space
Worked example
Question
Show that GME = ad2 is the same for a falling apple and the orbiting Moon. Assume that
the Moons orbit is circular.
Data:
Solution
Apple on Earth: GME = ad2 = 9.80ms2(6.37106m)2 = 3.981014m3s2
Moon in orbit: Centripetal acceleration ac =
v
r
2r
4 2r
a = 2
T
T
Orbit radius r = dEM; T=27.3243600 = 2.359106s
Orbital velocity v =
GME = ad2 =
4 2dEM 3
T
(2.359 10 s)
= 4.02 1014 m3 s 2
Extension: (Hard) Can you think of a reason for the ~1% discrepancy? (Hint: See Physics
Phile Finding new planets page40.)
37
Explaining
and exploring
the solar system
2 A line joining the planet to the Sun sweeps out equal areas in equal times. This
means the planet travels faster when it is closer to the Sun (Figure 2.2.3b).
3 For all planets orbiting the Sun, the square of the orbital period T is
proportional to the cube of the semimajor axis a. This is the law of periods.
T2
a3
constant
= constant
These laws hold for any orbiting system of two bodies if the mass of the
central body is very much larger than the mass of the other. They also apply to
circular orbits, since a circle is a special case of an ellipse.
If c is the half the distance between the foci of an ellipse, then eccentricity e is
defined as e = ac . A circle is an ellipse with zero eccentricity (e=0); both foci are
For a circle, the semimajor axis becomes the
together at the circle centre.
radius, a=r. Orbits of planets in our solar system are very nearly circular, the
two most eccentric being Mercury with e=0.2056 and Mars with e=0.0934.
Pluto has e=0.2482 but, sadly for Pluto-fans, it was demoted to a dwarf planet
in 2006.
a
Sun
semimajor axis
focus
perihelion
(closest to
the Sun)
focus
aphelion
(farthest from Sun)
focus
Area
Area
Figure 2.2.3 (a) A highly eccentric elliptical orbit. (b) Keplers law of areas: a line
joining the planet to the Sun sweeps out equal areas in equal times.
Newton derived Keplers laws from his laws of motion and gravitation. He
showed that Keplers first law (law of elliptical orbits) follows from the inverse
square law. By including his three laws of motion, he also proved Keplers second
law (law of equal areas). These derivations are beyond the syllabus; however,
showing Keplers third law, the law of periods, for a circular orbit is very easy.
Careful! Dont confuse the semimajor axis a with centripetal acceleration ac.
Suppose a satellite of mass m orbits a central body of mass M and m<<M
so that the acceleration of M is negligible. In a circular orbit, the satellites
acceleration is centripetal:
2 r
4 2r
v2
ac =
and v =
ac =
r
T
T2
where v is orbital velocity, r is orbital radius and T is orbital period.
The magnitude of gravitational force exerted on m is:
F = mac =
38
GmM
2
ac =
GM
r2
4 2r
T2
T2
r3
42
(which is constant)
GM
space
We derived this for a circular orbit, but it is also true for elliptical orbits if
we replace radius r with semimajor axis a.
T2
a3
42
GM
Because here the value of Keplers constant is explicit, we will call this the
explicit form of Keplers law of periods.
Worked example
r3
T
GM
4 2
Question
From Earth, you observe (almost edge on) the orbit of Jupiters moon Ganymede and
determine its orbital period to be T=7.15 Earth days. You measure the width of the orbit
to be w=2.14109m. Assuming the orbit is circular, determine Jupiters mass.
Solution
r=
M=
w
= 1.07 109 m
2
r 3 GM
=
T 2 4 2
4 2r 3
GT 2
4 2 (1.07 109 )3
6.67 1011 (7.15 24 3600)2
= 1.90 1027 kg
Uranus
Earth
Halleys
Comet
Neptune
Mars
Jupiter
Saturn
39
Explaining
and exploring
the solar system
large planet
centre of mass
star
Worked example
Question
Derive an expression for the magnitude of orbital velocity for a satellite in a circular orbit, in
terms of mass of the central body M and orbital radius r. Use this expression to calculate
the Moons orbital speed, assuming a circular orbit.
Data:
Solution
Gravitational acceleration:
ag =
GM
r2
v 2 GM
= 2
r
r
Rearrange:
v=
Moons orbital speed:
v=
GM
for circular orbits
r
= 1020 m s1
Note that orbital speed is independent of the mass of the satellite. One of the
consequences of this equation is that for circular orbits, the smaller the radius the
faster the orbital speed.
40
space
Checkpoint 2.2
1
2
3
4
5
6
7
8
9
10
11
central
body
hyperbolic
Isaac Newton showed that all orbits consistent with the Law of
Universal Gravitation and his laws of motion fall into one of four possible
circular
types: the first two suggested by Keplers first lawcircles and ellipses
parabolic
as well as parabolas and hyperbolas. (Remember that a hyperbola is the
curve you get when you graph y = xk ). The central body (if very massive)
would be at the focus of all these curves.
Consider Figure2.3.1 (and Figure 2.2.1). Suppose we increase the
elliptical
speed of the satellite executing the circular orbit. The orbit will become
Figure2.3.1 Gravity allows four possible kinds of orbits.
elliptical. If we further increase the speed so that we reach escape velocity,
the satellite wont return. (See Escape velocity: What goes up ? p 18).
The orbit is a parabola. If we now increase the speed beyond escape velocity,
the orbit becomes a hyperbola.
All these orbits are symmetrical in shape and speedthe speed of
the satellite at any position is identical to its speed at a mirror image position
(although obviously the direction is different).
Circles and ellipses are called closed or stable orbits because the satellite
follows a closed curve, repeating its motion periodically and indefinitely if
undisturbed. However, an object approaching a much larger mass doesnt always
fall into a stable orbit around it. Assuming that it doesnt hit the surface of the
larger mass, an object moving fast enough will execute an open orbita parabola
or a hyperbolaflying off after the encounter, never to return.
The kind of orbit depends on the sign of the mechanical energy ME:
ME = K + E P =
1 2 GmM
mv
2
r
where m is the mass of the satellite and M is the central mass. Here, r is the
instantaneous distance from the central mass. For a circular orbit, r is constant
(= the orbital radius), but r varies with time for an elliptical orbit. In the absence
41
Explaining
and exploring
the solar system
Try this!
Plasticine orbital mechanics
Isaac Newton showed that all possible orbits
are examples of conic sections, curves
resulting from taking planar slices of a cone at
different angles. Make an accurate cone out of
plasticine or foam and use a sharp knife to cut
sections to reveal orbital shapes:
Cut horizontally = circle.
Cut parallel to one edge of the cone = parabola.
Cut less steeply than the parabola = ellipse.
Cut more steeply than the parabola = hyperbola.
circle
ellipse
hyperbola
Worked example
parabola
asymptotes to
hyperbola
Question
A comet of unknown mass mC passes the Sun (MS = 1.991030kg).
At perihelion, the centre-to-centre distance is r=5.081010m and
the speed of the comet is v=7.50104ms1.
What type of orbit is it? Do you expect the comet to return?
Solution
Gm M
GM
1
1
ME = mCv 2 C S = mC v 2 S
2
r
r
2
hyperbola
1
6.67 1011 1.99 1030
= mC (7.50 104)2
5.08 1010
2
= mC ( +2.0 108 )
ME > 0, so the orbit is hyperbolic and the comet is not expected to return
(unless later interaction with another body reduces its energy sufficiently).
(Careful! When subtracting two numbers of very similar size, check
that the answer is not smaller than the smallest significant figure in
the calculation.)
42
space
Strictly speaking, when calculating gravitational potential energy, we really
should include the effect of other astronomical bodies such as nearby stars.
However, as long as the other bodies are either too small or too far away, we can
get away with calculating the two-body ME using only the masses of the central
The rules relating ME to the shape of the orbit only
and orbiting body.
apply to the two-body ME.
43
Explaining
and exploring
the solar system
the equator, so that at any time from anywhere at least six satellites are
contactable. Arctic regions are poorly covered by geostationary satellites, so
Russia uses communications satellites in Molniya orbitshighly elliptical,
semi-synchronous orbits inclined to the equator, designed to dwell over Arctic
regions for many hours each day. Russia and the USA both use such orbits for
spy satellites.
Checkpoint 2.3
1
2
3
4
5
6
Name the four shapes of orbits allowed by Newtons law of universal gravitation.
Describe the orbit normally followed by a projectile passing (without capture) a massive body. Explain why it is
rarely a parabola.
List the three different conditions on the sign of two-body orbital mechanical energy, and name the orbits that result.
Outline why low Earth orbits are temporary.
Describe the Van Allen radiation belts.
Define the terms geostationary, geosynchronous and semi-synchronous.
44
Now you know how to get into orbit, but what are you going to do while youre
up there? To tour the solar system on very little propellant, you will need to learn
to hitchhike and steal!
When exploring the outer parts of the solar system with a space probe, you
can save fuel and increase payload space by hitching a ride with some of the
planets. Planets carry an enormous reserve of orbital kinetic energy (and
momentum). By steering a space probe into a temporary hyperbolic orbit around
a planet, you can steal some of its momentum using gravity assist (or the
slingshot effect).
Consider Figure2.4.1. A space probe that approaches a planet (for example
Jupiter) with an inbound speed of Vin, is steered into a hyperbolic orbit by the
Jupiters gravity and leaves in a different direction with an outbound speed Vout.
Judged from Jupiters frame of reference (Figure2.4.1a), Jupiter is the central
mass and, as orbits are symmetrical, the two speeds are equal. Using lower case
text for velocities measured in Jupiters reference frame, vout = vin.
To calculate velocities as seen by an observer stationary relative to the Sun,
we simply add Jupiters velocity (relative to the Sun) to each velocity in Jupiters
reference frame. If we do this, it turns out that Vout>Vin; the trip around Jupiter
has increased the speed of the probe (Figure2.4.1b). Momentum is conserved,
so the probes gain in momentum is exactly balanced by the planets loss of
momentum. However, because of the planets enormous mass, the decrease in
its speed is immeasurably small.
space
We can also treat gravity assist as a perfectly elastic collision (see in2 Physics
@ Preliminary p66). The planet pulls the probe rather than pushing and the
collision is gradual, but the conservation of momentum still applies. It is elastic
because gravity conserves mechanical energythere is no friction. The planet
is like an enormously heavy cricket bat and the probe is a small, highly elastic
superball. In the bats frame of reference, the bat is a stationary immovable
object (see in2 Physics @ Preliminary p66), so in a highly elastic collision the
balls speed before and after the collision is practically unchanged. In the frame
of reference of the batsman, however, the ball has been given an increase in speed
by the moving bat.
In 1973, Mariner 10 first used gravity assist (from Venus) to achieve
a flyby of Mercury to gather images and other measurements. The Galileo probe
was launched from a Space Shuttle in 1989, reached Jupiter in 1995 and
studied Jupiters moons until 2003. To reduce the explosion hazard to the
Shuttles astronauts during launch, Galileos fuel requirement was decreased
by using gravity assist, once from Venus and twice from Earth, to slingshot
Galileo to Jupiter. The reduction in Earths orbital velocity was of the order
of only 1018ms1.
spacecrafts
velocity outbound
vout = vin
spacecrafts
trajectory
Jupiter
spacecrafts
velocity
inbound
resultant Vout
Worked example
Jupiter
Question
For gravity assist, the maximum possible speed increase occurs for the (unrealistic) extreme
limit at which the spacecraft executes a nearly 180 turn, parallel to the planets orbital
motion (Figure2.4.2).
Jupiters velocity
relative to the Sun
resultant Vin
Show that for this special case, the change in speed of the craft in the Suns frame of
reference is twice the orbital speed of the planet Vp. You can assume that in the Suns
frame, the probes speed is always larger thanVp.
Solution
Be careful! Change in speed VoutVin is the change in the magnitude of the velocity, but
it is not the same as the magnitude of change in velocity |VoutVin|.
All labelled velocities in Figure2.4.2 lie in one dimension. Upper case variables denote
quantities in the Suns reference frame, and lower case variables the planets frame.
As usual, bold = vectors, italic = magnitudes and sign = direction.
Vin, Vout and Vp are respectively the spacecrafts incoming and outgoing speeds and
the planets orbital speed in the Suns frame of reference. Because of orbital symmetry, in
the planets frame the spacecrafts incoming and outgoing speeds vin and vout are equal.
Let them both equal v.
vout
vin
b
Vout
(1)
Equate (1) and (2):
Rearrange:
v = Vin + Vp
v = Vin + Vp = Vout Vp
Vp
Vin
45
Explaining
and exploring
the solar system
Voyager 1
Saturn
12 Nov 80
Jupiter
9 July 79
Saturn
26 Aug 81
Jupiter
5 Mar 79
Earth 5 Sept 77
20 Aug 77
Pluto
Aug 89
Uranus
27 Jan 86
Voyager 2
Neptune
01 Sept 89
Figure2.4.3 Voyager 1 and Voyager 2 take the grand tour of the solar
system using gravity assist.
Checkpoint 2.4
1
2
3
4
In a gravity-assist manoeuvre, a probe increases its momentum. Explain how momentum is conserved and why
this might not be obvious to an observer.
During a slingshot manoeuvre, what is the shape of the probes orbit in the planets reference frame?
What can we say about the probes speed (far from the planet), before and after the slingshot manoeuvre, when
viewed from the planets frame?
If VP is the orbital speed of the planet, what is the maximum possible speed increase for a probe executing
gravity assist?
Orbital decay
Account for the orbital decay
of satellites in low Earth orbit.
46
Craft in low Earth orbits do experience some drag or air resistance, because the
atmosphere gradually thins out with altitude in a very roughly exponential trend
(Figure2.5.1). At altitudes above ~1000km, drag is considered negligible. Most
Earth-sensing satellites orbit below this altitude, and so have a limited lifetime.
Drag converts orbital kinetic energy into thermal energy, causing the orbital
radius to decrease (orbital decay). The lower the orbit, the greater the air density
(and drag) and so the faster the orbital decay. Sustained orbits are not possible at
space
100
105
1010
1015
100
200
300
700
800
900 1000
Rest In Pieces
of Skylab
Figure2.5.2 The largest surviving fragment
47
Explaining
and exploring
the solar system
s
in
cie
ffi
nt d
rag
re-entry corridor
sive drag
c
ex es
Deceleration
As at launch, astronauts seats during re-entry are oriented perpendicular to, and
facing (eyeballs in) the direction of acceleration, but this time acceleration is
opposite to velocity, so they look backwards.
Traditional re-entry vehicles (such as were used in the 1960s and 1970s) were
teardrop-shaped capsules with the blunt end pointing forward (Figure2.5.4).
They allowed very little or no control once re-entry had begun and provided very
little lift. Such a re-entry is called ballistic re-entry and requires larger re-entry
angles. This kind of capsule subjected the astronauts to a maximum re-entry
g-force of anywhere between 6 and 12. The Apollo re-entry angle was between
5.2 and 7.2
The Space Shuttle introduced in 1981 has wings that provide lift and flightcontrol structures (such as elevons, a rudder/speed brake and a body flap) that
allow considerable control over the descent, adjusting the vehicles aerodynamics
to the changing density of the air, and making re-entry more gentle, with a
maximum g-force of about 23. This degree of control also widens the safe re-entry
corridor, allowing a gentle, low-g 12 re-entry. This is called glide re-entry.
To further decrease descent speed without excessive g-force, the Shuttle performs
a series of Sshaped turns by rolling and banking, gently enhancing drag.
The Russian Soyuz capsule, in use continuously (in modified form) since the
1960s, is a more spherical variation of the traditional capsule shape but with
attitude control thrusters, which provide some glide control during re-entry.
It usually yields a g-force of 45, but sometimes up to about 8 for a completely
ballistic re-entry.
Ballistic re-entries are high acceleration but quickbetween 10 and 15
minutes. A full glide re-entry is low acceleration but slowShuttle re-entry,
for example, takes about 45 minutes. Soyuz is intermediate and takes about
30 minutes.
Heating
On re-entry, vehicles travel at well above the speed of sound. The speed of sound
is sometimes called Mach1 (after Ernst Mach (18381916) a physicist and
philosopher who studied gas dynamics). Twice the speed of sound is called
Mach2 and so on. Supersonic means travelling faster than Mach1. Hypersonic
usually means faster than Mach5 (oversimplifying somewhat).
Pressure builds up in front of projectiles. Sudden pressure changes normally
propagate away as sound (at the speed of sound). In supersonic flight, however,
48
space
this pressure wave is too slow to move out of the projectiles way, so the pressure
builds up to very high levels, forming a shock wavethe air equivalent of the
bow wave in front of a speed boat.
The enormous mechanical energy of orbit must go somewhere. Drag
converts it to thermal energy. Contrary to common sense, in hypersonic flight,
a blunt projectile with more drag actually gets less hot than a more streamlined
one. In the 1950s, Harvey Julian Allen proved this theoretically and explained
why the sharp nose cones of intercontinental ballistic missiles were vaporising on
re-entry. Hypersonic wind tunnel tests (see Figure2.5.5) confirmed his theory.
In hypersonic flight, much of the heat generation takes place in the shock
wave (the dark line wrapping around the front of both projectiles in Figure2.5.5).
The shock wave does not touch the blunt projectile (Figure2.5.5a) and so
doesnt transfer the heat efficiently to it, but it does touch the tip of the sharp
projectile (Figure2.5.5b), which gets much hotter. For this reason, re-entry
vehicles (including the Space Shuttle) are blunt at the front, and hence the
traditional teardrop shape of capsules.
The blunt front of the vehicle is also coated with a suitable heat shield with
very high melting and vaporisation temperatures. It is also highly insulating to
slow the rate of thermal conduction. Thermal insulating materials in most
applications are almost always very porous because tiny pockets of gas are very
poor thermal conductors. Some insulator materials are also designed to be highly
light-absorbing (black) in the visible and near infra-red parts of the spectrum
because such surfaces, when hot, also radiate thermal energy away more
efficiently (radiative cooling).
Tiles on the Shuttle surface are made of 90% porous silica fibre, which is an
excellent high melting point insulator, but it is brittle. The tiles on the hottest
parts (the underside and leading edges) are also coated with a tough black glass
to enhance radiation of thermal energy, but also to provide mechanical strength.
Broken tiles were believed to be responsible for the destruction during re-entry
of the Shuttle Discovery in 2003.
In more traditional space capsule ballistic re-entry, drag is higher, so heat is
generated more rapidly, and insulation and radiation alone are not enough. In
these cases, the insulating heat shield is also designed to vaporise and erode
(ablation). The hot, vaporised and ablated material carries thermal energy away
rather than conduct it to the capsule, similar to the way in which evaporating
sweat carries away excess heat from your skin. The pressure from this ablation
also helps to push away the hot gas convecting from the shock wave. The shield
must be thick enough to last the journey and provide sufficient insulation.
Two modern examples of ablating materials are phenolic impregnated carbon
ablator (PICA) and silicone impregnated reusable ceramic ablator (SIRCA). In
the Chinese space program, one of the ablation materials used is blocks of oak
wood. Its cheap and easy to work. As it chars, it forms charcoal, which is porous
and almost pure carbon, making it an extraordinarily good thermal insulator
with a very high melting point. Another advantage is that porous carbon is very
black and radiates thermal energy efficiently. However, it is mechanically weaker
than more high-tech ablation materials.
During re-entry, superheated air surrounding the vehicle is ionised. The air
becomes a plasmaa conductive soup of free positive and negative charges that,
like the Earths ionosphere (see in2 Physics @ Preliminary pp1534), reflects
Explaining
and exploring
the solar system
a
Figure2.5.5 Hypersonic wind tunnel tests. (a) The crescent-shaped shock wave is detached from
the blunt projectile, but (b) touches the tip of the sharp projectile.
radio waves, so the astronauts cannot communicate with the Earth for several
minutes during re-entry. This problem has been solved for the Shuttle by
communicating via a satellite above it, since only the bottom of the Shuttle has
significant ionisation.
Landing
Drag depends on the projectiles cross-sectional area and speed. Drag cannot stop
a projectile completely because, during deceleration, drag decreases until it
exactly cancels the weight of the projectile and deceleration stopsthe projectile
has reached terminal speed (see in2 Physics @ Preliminary p45). The terminal
speed of a capsule is too high for it to land safely. To slow the capsule further for
the landing, drag is enhanced (and terminal speed decreased) by using parachutes
to increase the effective area of the capsule.
The final touchdown could be on land (typical of Russian missions) or a
splashdown in the water (typical of US missions pre-Shuttle). Russian Soyuz
also has soft-landing engines that fire just before it touches the ground.
The Space Shuttle lands on a runway, much like an aeroplane (Figure2.5.6)
but it uses parachutes to help it brake. During landing, the Shuttle (which has
been described as being like flying a brick with wings) is controlled entirely by
computer.
Another issue is accurate targeting of the landing site. The steeper the
re-entry angle, the smaller the horizontal component of motion (range) and so
the more accurate the prediction of the final landing site. However, the Shuttle
makes up for its shallow re-entry, because its aeroplane-like flight-control
structures allow adjustment of the landing path. The shape of the landing path is
also designed to be more forgiving. The Shuttle approaches the runway roughly
opposite to the landing direction. Fourminutes from touchdown, it does a
heading-alignment loop, to adjust precisely to the direction of the runway
(Figure 2.5.6).
50
space
Altitude 25 000 m
Mojave
Runw
ay 23
E
d
Airfor wards
ce Ba
se
Figure 2.5.6 Scale drawing of the relatively gentle descent of the Space Shuttle. The Shuttle is
drawn at 1minute intervals to touchdown. The squares on the ground are 10 nautical
miles (18.5km) wide.
Checkpoint 2.5
1
2
3
4
5
6
7
8
9
10
11
51
Explaining
and exploring
the solar system
PRACTICAL EXPERIENCES
CHAPTER 2
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Use the template provided in the activity manual to extract information about
your chosen scientist. Process this information to make a short oral presentation
to the class.
Discussion questions
1 For the scientist that you have researched, list their main contributions
to space exploration.
2 Explain how later scientists have benefited from this research.
Extension
3 Werner von Brauns great Russian rival, the Chief Designer for the USSR
space program Sergey Korolyov, is not as familiar as some of the names
mentioned in the syllabus, despite leading the launch of the first artificial
satellite, Sputnik, in 1957. This is probably because his name was kept
secret by the communist government of the USSR until after his death in
1966. You may also want to research his contribution to space exploration.
F =m
v2
r
Perform an experiment that will allow you to determine the relationship between
the radius of a satellites orbit around the Earth and its gravitational force.
Equipment: string, rubber stopper, mass carrier and masses, electronic scales,
glass or plastic tube, paperclip, sticky tape, metre ruler, stopwatch.
Discussion questions
1 From your experimental data,
determine the mathematical
relationship between the
orbital radius of a satellite and
its tangential velocity for a
given centripetal force.
2 Describe the method you
would use to determine the
centripetal force on a small
model satellite.
tension
glass or
plastic tube
paperclip
string
mass carrier
52
mg
Chapter summary
mv 2
(for circular orbits)
R
Keplers laws (apply to any two bodies if the central
body has a very much larger mass):
1 Orbits of planets are ellipses, with the Sun at one
focus.
2 Planets sweep out equal areas in equal times.
T2
3 Law of periods:
= a constant
a3
a 3 GM
Explicit form of Keplers third law:
=
T 2 42
Eccentricity is a measure of the elongation of an ellipse.
A circle is an ellipse of zero eccentricity.
Periapsis is the position of closest approach to the
central mass and fastest orbital speed (perihelion for the
Sun, perigee for Earth).
Apoapsis is the position of furthest distance from the
central mass and slowest orbital speed (aphelion for
the Sun, apogee for Earth).
If the mass of a satellite is not negligible when compared
to that of the central body, then both masses orbit with
the same period around the systems centre of mass.
Fc =
space
GM
r
Two-body orbital mechanical energy
1
GmM :
ME = mv 2
2
r
Magnitude of orbital speed: v =
53
Review questions
Explaining
and exploring
the solar system
Physically speaking
For each type of orbit, fill in the missing information. One has been done already.
Name of orbit
Sign of two-body ME
Open or closed
Bound or unbound
Negative
Closed
<
Bound
Geostationary
Slingshot (in planets frame)
Elliptical
Hohmann (see Physics Focus)
Parabolic
Hyperbolic
Halleys Comet
Circular
Molniya
Reviewing
Solve problems and analyse
information to calculate the
centripetal force acting on a
satellite undergoing uniform
circular motion about the
Earth using:
2
mv
r
Solve problems and analyse
information using:
mm
F =G 1 2
d2
Analyse the forces involved
in uniform circular motion
for a range of objects,
including satellites orbiting
the Earth.
F =
1 What was the first solid rocket propellant and who invented it?
2 Assuming that propellant is burned at a constant mass per unit time, use the
equation for thrust to explain why forcing exhaust gas through a narrow nozzle
increases thrust.
6 At the bottom of a bungee jump with the cord attached to the ankles, one can
easily experience a g-force of 3, the maximum normally allowed for Shuttle
launches. Describe three important differences in the way the g-force is
experienced in these two situations.
7 Explain why launch facilities are usually built as close to the equator as
is practical.
8 Describe the circumstances under which a star would not sit at the focus of
a planets elliptical orbit.
9 Discuss why we only briefly see Halleys Comet with an unaided eye every 76 years,
even though it is in orbit continuously around the Sun.
10 Outline what happens to the period of a satellite if its semimajor axis is reduced
by a factor of 4.
12 By re-examining the gravity assist worked example on page 45, show that the
magnitude of change in the probes velocity in the Suns reference frame is twice
the probes initial speed in the planets reference frame (that is, |VoutVin|=2v).
14 List two reasons why (human-crewed) space stations are always in low Earth orbit.
54
space
Solving Problems
15 Calculate the two-body gravitational potential energy for a system consisting
of a 1.00kg test mass sitting on the surface of the Earth. How far would
the test mass need to be from the Sun so that the two-body GPE of the
test massSun system is the same value? Estimate roughly where that
position would be in relation to the orbital radii of the planets.
5.00m radius. You have a g-force meter with you and at the moment
youre at the top it reads a vertical g-force of 0.00.
a What is your weight at that moment?
b What is the magnitude of the normal force exerted on you by the seat
at that moment?
c What is your centripetal acceleration?
d Calculate your speed at the top.
e Assuming friction and air resistance are negligible, calculate your
horizontal g-force at that moment.
Prunella then says: Yeah, but as long as the orbital speed v is high
enough, will be very close to 90 so you can use the approximation that
string tension T is the centripetal force and the string length L represents
orbital radius R.
v2
Show that as long as orbital speed v fulfils the condition
>7g,
R
then L is no more than 1% larger than the true orbital
radius R and T is no more than 1% larger than the true
centripetal force Fc.
mv 2
r
Analyse the forces involved in
uniform circular motion for a
range of objects, including
satellites orbiting the Earth.
F =
T sin
mg
55
Explaining
and exploring
the solar system
20 Using the mass of Jupiter calculated in the worked example on page 39,
21
predict the period of Jupiters moon Callisto, given that its semimajor axis
is 1.89106km.
2r
and the explicit form
Using the equation for tangential velocity v =
T
of Keplers law of periods, re-derive the expression from section 2.2, for
the magnitude of orbital velocity of a satellite in circular orbit:
v=
GM
r
22 Using the explicit form of Keplers third law, calculate the radii of:
a a geostationary orbit
b a circular semi-synchronous orbit.
23 Draw a table in which you compare the kinetic energy, the two-body
25 Gravity assist can also be used as a brake. Show that if the diagram in the
planets frame in Figure2.4.1a is unchanged, but the planets orbital
velocity in Figure2.4.1b is reversed, then the probes speed in the Suns
frame would decrease.
Re
56
iew
Q uesti o
space
PHYSICS FOCUS
final orbit
elliptical
transfer orbit
Figure 2.6.3
A Hohmann elliptical
transfer orbit
Extension
7 A Space Shuttle releases a communications satellite
into an initial orbit of radius r=6.66106m.
Using a Hohmann transfer orbit, the satellite
moves up to a geosynchronous orbit of
rgeo=4.216107m. Using the result of
Question 3, calculate how long the transfer takes.
(Hint: The transfer path is half an orbit.)
57
3
special relativity, general relativity,
inertial frame of reference, invariant,
principle of relativity, fictitious forces,
Maxwells equations, electromagnetic
wave, mechanical medium,
luminiferous aether, interferometry,
beamsplitter, aether drag, Michelson
Morley experiment, null result,
postulate, simultaneity, Lorentz factor,
time dilation, proper time, twin paradox,
proper length, length contraction, rest
mass, proper mass, relativistic mass,
spacetime, mass-energy,
Seeing in a weird
light: relativity
Just some minor problems
You may have heard it said that some physicists think that a theory of
everything is just around the corner. This attitude is not new. Many
physicists thought this about what is now called classical Newtonian
physics, towards the end of the late 19th century. There were just a few
minor problems with understanding the way light travels through space
that needed to be fixed, and then the job of physics would be finished.
Well, those minor problems with light led to the twin pillars of
modern physics: quantum mechanics and relativity. Einstein was a key
player in both, especially relativity, which comes in two parts. Special
relativity replaced Newtons mechanics and the later general relativity
replaced Newtons universal gravitation. A century later, relativity still
defies common sense, bending space, time and the mind, but it has
not yet failed any experimental test. In this chapter we will only deal
with the theory of special relativity.
Common sense is nothing more than a deposit
of prejudices laid down by the mind before
you reach eighteen.
A Einstein
58
space
nte
ractiv
M o d u le
Try this!
Fictitious fun
While sitting on a playground
merry-go-round with a friend,
try playing catch with a slow
moving tennis ball. The fictitious
centrifugal and Coriolis forces
will cause the ball to appear to
follow warped trajectories,
making it difficult to catch.
Worked example
mg
question
mg tan 5
A Christmas decoration is hanging obliquely inside your car, 5 from vertical and pointing
towards the cars left side. Describe quantitatively the cars motion (no skidding!).
Solution
Only two true external forces act on the decoration: tension and weight (Figure3.1.2).
Because there is an angle between them, they arent equal and opposite, so the decoration
experiences a net real force and acceleration sideways (in this case centripetal). The net
force and acceleration point towards the right side of the car, so the accelerometer
(and the car) is steering towards the right.
left
right
mg
Seeing in a
weird light:
relativity
The centrifugal force perceived by the occupants of the car to be pulling the decoration
toward the left side of the car is fictitious.
From Figure3.1.2, the magnitude of the centripetal acceleration is:
F
ac = c = g tan5=9.800.0875=0.86ms2
m
PRACTICAL
EXPERIENCES
Activity 3.1
rotating frames of
reference and is
associated with the
formation of cyclones.
Checkpoint 3.1
1
2
3
4
5
6
7
60
space
61
Seeing in a
weird light:
relativity
waves seem to obey the Galilean transformation. It was assumed that light should
also obey it, so the speed of light should be affected by the motion of the aether.
However, Maxwells equations appeared to allow only one particular value for
The Galilean transformation and Newtons
the speed of light in a vacuum.
laws imply it is impossible for the speed of light to appear to be the same to all
observers with different relative speeds. Perhaps the speed specified by Maxwells
equations is the speed relative to the aether only. However, this meant that the
aether represented a preferred reference frame for Maxwells equations, which was
inconsistent with the classical principle of relativity.
M and M
Describe and evaluate the
MichelsonMorley attempt to
measure the relative velocity of
the Earth through the aether.
Discuss the role of the
MichelsonMorley experiments
in making determinations about
competing theories.
PRACTICAL
EXPERIENCES
Activity 3.2
62
Given that the Earth was supposed to be hurtling around the Sun, through the
aether at 3104ms1, the resulting aether wind (or aether drift) relative to
Earth should affect measurements of light speed differently according to the time
of day and time of year as the Earth rotated and orbited the Sun, changing its
orientation relative to the aether.
So in the 1880s, the experimentalist Albert Michelson (18521931), joined
later by Edward Morley (18381923), attempted to measure changes in the speed
of light throughout the day due to this shifting aether wind. They used
a very sensitive method called interferometry (see section21.5), which
Michelson had used some years earlier to accurately measure the speed of light.
Recall constructive and destructive interference (see in2 Physics @ Preliminary
p102 and p126). If two light beams are projected onto a screen, then a bright
fringe occurs at places where the two beams are in phase (constructive
interference). Where they are out of phase, destructive interference results in a
dark fringe. The pattern of bright and dark fringes is called an interference
pattern (Figure 3.2.2).
Interference turns a pair of monochromatic (single wavelength) light
beams into an extremely sensitive ruler for which the interference fringes are like
magnified ruler markings one light wavelength apart. For visible light, this
spacing is less than 8107m and corresponds to time intervals of less than
31015s. If the two light beams travel via different paths, then a very small
change in the length of one path will change the relative phase, resulting in
a detectable change in the position of fringes in the interference pattern.
A change in wave speed along one of those paths should have a similar effect
on phase.
Michelson and Morley set up an interferometer in which the light was divided
into two perpendicular beams or arms by passing it through a half-silvered
mirror or beamsplitter (Figure3.2.3). The apparatus was built on a heavy stone
optical bench floating in mercury, to allow rotation and damp out vibrations.
They assumed that if one interferometer arm was pointing parallel to the aether
wind, the speed of light should be slightly different in the two arms. The time of
flight of the light in the arm parallel to the aether wind should be slightly longer
than that of light along the perpendicular arm. As the Earth (or the apparatus)
rotates, this speed difference, as measured by the positions of the interference
fringes (Figure 3.2.2), should change with the angle.
Figure 3.2.4 summarises the classically predicted effect of aether wind on the
resultant light speed in the two arms of the interferometer. Lets calculate the
expected time difference. Suppose the total distance from beamsplitter MS to M1
(or M2) is L, then the round-trip for each arm is 2L.
space
a
M1
source
Ms
aether
wind
l1
l2
M2
M2
Ms
eyepiece
source
2L
1
1
v2
c2
In the arm parallel to the aether wind (speed v), for half the trip against the
wind, the speed of light would be cv, and for the other half with the wind it
would be c+v, so the time taken would be longer:
L
L
2L
t2 =
+
=
c v c +v
c
1
1
v2
c2
(Check that you agree that since v is smaller than c, time in the parallel arm t2
is longer than time in perpendicular arm t1.)
Other factors such as thermal expansion or contraction of the apparatus
could cause apparent drift in the interference pattern, but the shift due to
rotation of the apparatus (or the Earth beneath it) would be a sine wave with
a period equal to the rotation period of the apparatus, so any drift not due to
rotation could be detected and subtracted. Michelson and Morley graphed the
position of interference fringes versus rotation angle at different times of the
day, but concluded that the small observed shifts could be explained as drift in
the experimental apparatus. Over several years, scientists repeated the
measurements, with some reports of possible changes in interference over the
day; but eventually the consensus was that any observed effect was well below
what was expected by the aether theory and could be explained by drift in
the apparatus.
George Fitzgerald and Hendrik Lorentz attempted to squeeze the Galilean
transformation into Maxwells equations, concluding that charged particles
(such as charges in atoms) moving through the aether with speed of v must
C 2 V 2
V
C
V
aether wind
t1 =
aether wind
In the arm perpendicular to the aether wind (speed v), if c is light speed
M1
C V
C+ V
Seeing in a
weird light:
relativity
Another reason suggested for failure to see the shift was that perhaps aether
only penetrates transparent objects, so aether was trapped by large opaque
mountains and valleys or buildings and dragged along by the moving Earth,
similar to the way in which air is trapped in the fur of a running dog. A flea
conducting scientific experiments on the dogs skin would be unaware that outside
the fur, air is whooshing backwards relative to the dog. This idea was called aether
drag. If this were true, then at the tops of mountains, closer to outer space, the
aether wind might be detectable. Some experimenters repeated the experiment on
mountains without success (apart from a controversial partial result).
Failure to detect undeniable effects of aether wind caused some physicists to
question if it even existed. Maxwells equations only mention electric and
magnetic fields. The aether is not required by the equations. Einstein assumed it
didnt exist, but said that relativity was not an attempt to explain Michelson and
Morleys negative result, but rather, he was motivated by the properties of
Maxwells equations. However, in physics, when experiment and theory say the
Today almost all physicists
same thing, youre probably on the right track.
agree that there is no aether.
The MichelsonMorley experiment is often called the most famous failed
experiment. It was not exactly a failure. In 1907, Michelson was awarded the
The result of an experiment that fails
Nobel Prize for Physics for his work.
to find evidence of an expected effect despite careful design and execution is more
correctly called a null result. This null result was one of the most important in
the history of physics, because it helped bring about a whole new way of seeing
the universe.
Checkpoint 3.2
1
2
3
4
5
6
7
8
64
space
Speed of light
Newton regarded space and time as absolute. In practical terms, this means that
the length of 1 metre, the duration of 1 second and the geometric properties of
shapes would be the same to all observers everywhere. Not all physicists agreed,
but the success of Newtons laws silenced any philosophical discussion.
However, Maxwells theory (and the MichelsonMorley experiment) pointed
to the speed of light in a vacuum being constant to all observers. So Einstein said
one of three things must be wrong: the principle of relativity (the invariance of
laws of mechanics in all inertial reference frames), Maxwells equations or the
Galilean transformation (the basis of all of classical mechanics).
The principle of relativity seemed very fundamental to Einstein, so he didnt
reject that. In fact, he extended Galileo and Newtons principle of relativity to
include all laws of physics, not just mechanics. He called it his first postulate.
Following a suggestion by Jules Henri Poincar (18541912), Einstein
decided that as the speed of light in a vacuum was invariant in all inertial frames,
then that must also be a law of nature, which he called the postulate of the
constancy of the speed of light.
Maxwells equations accurately described electromagnetic phenomena, so
Einstein didnt want to reject them. So it must be the Galilean transformation
(and hence all of classical mechanics) that was wrong. But it is difficult to see how
something so simple could possibly be wrong.
Suppose you are on a moving train, shining a torch towards the front of the
carriage. To your eyes, the light travels the length of the carriage L. To you, its
speed is the length of the carriage divided by the time t it took to get there c=L/t.
To an observer at the train station, the light travelled the length of the carriage
plus the distance D the carriage travelled in that time: c=(L + D)/t. The
arithmetic is so laughably simple. How could both observers possibly get the same
value for c? It could only be possible if you and the observer at the train station
In other words, if the
disagree on the lengths L or D or the time period t.
speed of light is constant then length (space) and/or time are not absolutethey
must depend on the state of motion of the observer.
So why had no-one noticed until 100 years ago? Classical mechanics had
successfully described phenomena for three centuries, but it had never been tested
Classical mechanics and the
for things moving at close to the speed of light.
Galilean transformation are accurate approximations at speeds well below the
speed of light. Only when the properties of light itself were examined, did the
problems become obvious.
the speed of light is the ultimate
Einstein showed (in several ways), that
speed limitno observer can reach the speed of light. As a teenager, he asked
What would the world look like if I rode on a light beam? He answered as an
adult with a thought experiment. A light beam is a wave of oscillating electric and
magnetic fields moving at the speed c. If you were in the same reference frame as
the light beam, you would observe stationary electric and magnetic fields that
vary as sine waves in space, but are constant in time. This is not an allowable
solution to Maxwells equations, so it is not possible for an observer to travel at
the speed of lightit is the ultimate speed limit.
Simultaneity
Einstein demonstrated that simultaneity is relative. Events apparently simultaneous
to one observer are not necessarily so to all observers. Lets use Einsteins own
Whats so
Special about
Relativity?
Seeing in a
weird light:
relativity
O2
O1
platform
c2
t0
1
mv =
v2
v2
c2
m0
1
v2
c2
Time dilation
The relativity of simultaneity suggests that time itself is a bit rubbery. Relativity
predicts that the rate of passage of time differs, depending on the velocity of
the observer.
Consider the following thought experiment (Figure3.3.2). Suppose you
(observer 0) are on a train, moving with speed v relative to the ground, while
measuring the speed of light by shining a light pulse vertically towards a mirror on
the train ceiling, a distance D from the light source. Outside on the ground is
observer v who appears to you to be rushing past with horizontal speed v and
watching your experiment. Both you and observerv regard your own reference
you both agree on three things: (1) the speed
frame as stationary. However,
of light, (2) your relative horizontal speed v and (3) the height of the mirror D.
You both agree on the height of the mirror because you are both in the same
reference frame with respect to vertical components.
Your light source also contains a detector capable of timing the interval t0
between the emission and detection of the light pulse. You do the experiment
and use the speed formula c = 2D/t0 to obtain the correct speed of light.
a
v
mirror
mirror
D
source and
detector
vtv
source
detector
Figure3.3.2 Measuring the speed of light on a train as seen by (a) observer 0 within the train and
(b) observer v from the reference frame on the ground
66
space
Observerv disagrees about what happened. She was carrying an accurate
stopwatch and timed the event independently, getting a longer time interval of tv.
Using Pythagoras theorem, she calculates that the path length of the light
(Figure3.3.2b) was not 2D, but rather:
1
2
2 D 2 + ( vt v )2 = 4 D 2 + (vt v )2
c=
tv
Square and rearrange: c2tv2 = 4D2 + (vtv)2 but observer 0 says c = 2D/t0
Eliminate D:
Rearrange:
t0
1
v2
c2
7
6
5
tv 4
t0 3
2
1
0
0.2
0.4
0.6
Speed VC
0.8
1.0
Seeing in a
weird light:
relativity
Worked example
Muon and on
and on
question
A muon is like a heavy electron, and at low speed it decays with a mean lifetime of
2.2106s. Suppose a beam of muons is accelerated to 80% of the speed of light.
What would their mean lifetime be in the laboratory reference frame?
Solution
Lifetime in the muons frame: t0=2.2106s
Speed of muons frame: v=0.80c
t0
Lifetime in the laboratory frame is tv: t v =
1
2.2 106
2
1 0.80
= 3.7 106 s
c
The twin paradox
When two people pass by quickly, observing each other, they both think the
others clock is running slower. The principle of relativity says you are both right.
The twin paradox is a thought experiment in special relativity. Bill goes for an
intergalactic cruise travelling at close to the speed of light (in Earths frame),
while Phil stays on Earth (Figure3.3.4). During the flight, they both correctly
conclude that the other twins frame is moving, and so he is ageing more slowly.
But what happens when Bill comes back home? Observations in the same
frame should agree. It turns out that Bill is younger than Phil. Does this violate
the principle that all inertial frames of reference are equivalent? No. Bill turned
around (accelerated) to come home. The situation is no longer symmetrical.
Special relativity isnt enough to explain what Bill saw from his accelerating
frame (he needs general relativity). However we have no difficulty talking about
what (non-accelerating) Phil saw.
By turning around and coming back, Bill left his original inertial frame and
re-entered Phils frame, so he should agree with Phil. Phil remained in his inertial
frame all along, so his conclusions (that Bill was moving and so is younger) have
been consistent with special relativity throughout and, in his frame, correct.
If instead Phil had hopped into another craft and caught up with Bills inertial
frame, then Bills original conclusion would have been correct and Phil would
have been younger.
This prediction has been confirmed using highly precise, twin atomic clocks
and an aeroplane.
Checkpoint 3.3
1
2
3
4
5
6
7
8
68
space
Length contraction
There is a grain of truth in Lorentz and Fitzgeralds suggestion (section3.2)
tv =
v2
c2
m0
1
v2
c2
v2
c2
c2
t0
1
mv =
v2
1.2
1
0.8
lv 0.6
l0
0.4
Imagine that observer1 and observer2 are trying to measure the length of
0.2
a rod, but all they have is a stopwatch. They already know accurately (and agree
on) their relative speed v. Observer1 is holding the rod and observer2 is holding
0
0
0.2
0.4 0.6 0.8
1.0
the stopwatch. They whoosh past each other almost touching, both looking at
Speed VC
the watch.
Figure3.4.1 Plot of the ratio of length in a
Observer2 is stationary relative to the watch (Figure3.4.2a), so he knows the
moving reference frame to
reading on his watch is his proper time. As the rod passes by, the watch reads
proper length (lv/l0) versus
zero at the start of the rod and t2 at the end, so the rod took a time t2 to pass by.
speed in units of c. Note that as
Therefore he calculates that the length of the rod in his frame is lv=vt2.
speed approaches c, lv shrinks
to zeroanother reason why the
Observer1 is stationary relative to the rod (Figure3.4.2b), so she knows that
speed of light is unattainable.
its length for her is the proper length l0. She agrees that the watch says t2, but the
moving watch seemed to be ticking too slowly, so the number on the watch must
be too small. Using the time dilation formula,
she calculates that the time t1 in her frame
a
b
was longer:
t2
v
t1 =
v
2
v
1 2
c
Figure3.4.2 Measuring the length of a rod using a stopwatch as seen by (a) observer 2,
60
55
60
60
10
50
15
45
20
40
35
30
25
55
10
50
15
45
20
40
35
30
25
Seeing in a
weird light:
relativity
b
v
C'
A'
B'
c
question
v
A'
B'
c2
Worked example
D'
C'
v2 .
l v = l0 1
The distance travelled by light in one year, 9.461015m, is called a light-year (ly).
The nearest star to our Sun is Proxima Centauri, 4.2light-years away.
Suppose you are travelling to Proxima Centauri at three-quarters of the speed of light.
a Calculate how long it takes to get there from Earth (measured using your
on-board clock).
b Discuss whether this answer is a contradiction.
Solution
a Both you and Earth-bound observers agree on your relative speed 0.75c. In the
spaceships frame, the distance to Proxima Centauri is contracted:
l v = l 0 1
t=
v2
c
l v 2.78 ly
=
= 3.7 years
c 0.75c
3.7 years is less than the 4.2 years that light takes to get there in Earths frame.
b
This is not a contradiction because in the spaceships frame, light would only
take 2.78 years because lv = 2.78 ly.
a
Earth
Proxima
Centauri
b
v
v
Earth
Proxima
Centauri
70
Note that in the last example, the astronauts thought they experienced a
short trip because the distance travelled was contracted, whereas the earthlings
thought the astronauts felt their trip was short because their time had slowed.
space
Relativistic mass
If you measure the mass m0 of an object at rest in your frame (rest mass or
proper mass) and use the classical definition of momentum p=m0v, then in
collisions, momentum is not necessarily conserved for all reference frames.
However, momentum is conserved if one instead uses p=mvv where mv
is the relativistic mass:
m0
mv =
v2
1 2
c
The relativistic mass of an object increases as its speed relative to the observer
increases. As speed approaches c, the mass approaches infinity, so the force
required to accelerate an object to the speed of light becomes infinite. This is yet
another reason why the speed of light cannot be reached.
When accelerating particles in accelerators, this increase in mass needs to be
taken into account, otherwise the machines wont work.
7
6
v (A rel. to B) =
vA v B
v v
1 A 2B
c
5
mv 4
m0 3
2
1
0
Relativistic
train crash
0.2
0.4
0.6
Speed CV
0.8
Worked example
question
A medical linear accelerator (linac) accelerates a beam of electrons to high kinetic energies.
These electrons then bombard a tungsten target, producing an intense X-ray beam that can
be used to irradiate cancerous tumours. A typical speed for electrons in the beam is 0.997252
times the speed of light.
Calculate the Lorentz factor and hence the relativistic mass of these electrons, given the
rest mass is 9.111031kg.
Solution
Lorentz factor =
1
1
mv =
m0
1
v2
1
1 0.997252
= 13.5
c2
= 9.11103113.5 = 1.231029kg
c2
Note: When calculating Lorentz factors close to the speed of light, use a greater number of
significant figures than usual, because you are subtracting two numbers of very similar size.
71
Seeing in a
weird light:
relativity
Solve problems and analyse
information using:
E = mc2
l v = l0 1
tv =
c2
t0
1
mv =
v2
v2
c2
m0
1
v2
c2
PHYSICS FEATURE
Twisting spacetime ...
and your mind
1. The history of physics
72
space
How does this formula behave at low speeds (when v2/c2 is small)?
mv c 2 =
m0c 2
1
v2
v2
= m0c 2 1 2
c
1
2
c2
Using a well-known approximation formula that you might learn at university,
(1 x )n1 nx for small x:
v2
m0c 2 1 2
c
1
2
1 v2
1
m0c 2 1 + 2 = m0c2 + m0v2
2
2 c
1
m v2
2 0
In other words, at low speeds, the gain in relativistic mass (mv m0)
multiplied by c2 equals the kinetic energya tantalising hint that at low speed
mass and energy are equivalent. It can also be shown to be true at all speeds,
using more sophisticated mathematics. In general, mass and energy are
equivalent in relativity and c2 is the conversion factor between the energy unit
(joules) and the mass unit (kg). In other words:
Rearrange:
E = mc2
where m is any kind of mass. In relativity, mass and energy are regarded as the
same thing, apart from the change of units. Sometimes the term mass-energy is
used for both. m0c2 is called the rest energy, so even a stationary object contains
energy due to its rest mass. Relativistic kinetic energy therefore:
m0c 2
mv c 2 m0c 2 =
m0c 2
2
v
1 2
c
Whenever energy increases, so does mass. Any release of energy is
accompanied by a decrease in mass. A book sitting on the top shelf has a slightly
higher mass than one on the bottom shelf because of the difference in
gravitational potential energy. An objects mass increases slightly when it is hot
because the kinetic energy of the vibrating atoms is higher.
Because c2 is such a large number, a very tiny mass is equivalent to a large
amount of energy. In the early days of nuclear physics, E=mc2 revealed the
enormous energy locked up inside an atoms nucleus by the strong nuclear force
that holds the protons and neutrons together. It was this that alerted nuclear
physicists just before World War II to the possibility of a nuclear bomb. The
energy released by the nuclear bomb dropped on Hiroshima at the end of that
war (smallish by modern standards) resulted from a reduction in relativistic mass
of about 0.7g (slightly less than the mass of a standard wire paperclip).
Evil twins
Worked example
question
When free protons and neutrons become bound together to form a nucleus, the reduction in
nuclear potential energy (binding energy) is released, normally in the form of gamma rays.
Relativity says this loss in energy is reflected in a decrease in mass of the resulting atom.
73
Seeing in a
weird light:
relativity
Exploding
a myth
Calculate how much energy is released when free protons, neutrons and electrons
combine to form 4.00g of helium-4 atoms (2 protons + 2neutrons + 2 electrons). At room
temperature and pressure, each 4g of helium gas is about 25L, roughly the volume of an
inflatable beach ball.
Data:
c2 = 8.98761016m2 s2
Solution
Total mass of the parts:
= 6.696921027kg
Reduction in mass:
= 5.04441029kg
E = mc2 = 5.04441029kg8.98761016m2 s2
= 4.53371012J
Checkpoint 3.4
1
2
3
4
5
6
7
8
74
Discuss why, if Lorentz and Fitzgerald came up with the correct formula for length contraction, Einstein gets the
credit for explaining relativistic length contraction.
Write the formula for length contraction. Would a ruler moving lengthwise relative to you appear shorter or longer?
Define the term proper length.
To what limit does observed length of a moving object tend as speed approaches c?
Write the formula for relativistic mass. Would a mass moving relative to you appear larger or smaller?
Use relativistic mass to justify the statement that the speed of light is the universal speed limit.
Define all the terms in the equation E = mc2 and explain what the equation means.
Explain why an atom weighs less than the sum of its parts.
PRACTICAL EXPERIENCES
space
CHAPTER 3
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
60
20
10
20 30 40
0 10
50
90 8
0 70 60 50
40
30
Discussion questions
1 The principle method for detecting a non-inertial frame is measurement
of acceleration. Describe an example of a non-inertial frame in which
a typical accelerometer would not appear to measure an acceleration or
detect extra fictitious forces.
2 Is there a test that can be performed within a frame of reference to tell
if the effect measured by the accelerometer is the result of acceleration
of the frame or due to an actual additional force?
80
Perform an investigation to
help distinguish between
non-inertial and inertial frames
of reference.
70
Discussion questions
1 Describe what Michelson and Morley were expecting to observe if aether
were present.
2 Using the data you have gathered, explain how your observations support
or refute the existence of the aether.
3 Recall the interpretation put forward by Michelson and Morley.
4 Discuss the importance of this experiment.
Extension
1 Research the history of how long the belief in aether persisted in some
physicists after the publication of special relativity in 1905.
2 Read the following paper, which contains a thorough review of the history
of the MichelsonMorley experiment, including historical letters to and
from several researchers:
Shankland, RS, 1964, MichelsonMorley Experiment, American Journal
of Physics, vol.32, p 16.
75
Chapter summary
Seeing in a
weird light:
relativity
Review questions
Physically speaking
Use the words below to complete the
following paragraph:
the _______________ of the speed of light does not _______________ between frames,
so light doesnt obey the transformation formula of _______________. Because of
this, measurements of _______________, _______________ and _______________ within
a reference frame moving relative to the _______________, will depend on the
_______________ of that frame.
76
space
Reviewing
1 You have a priceless Elvis Presley doll hanging from
your rear-vision mirror at a constant angle from
vertical. Elviss feet lean towards the front of the car.
Are you driving:
A forwards at uniform speed?
B backwards at uniform speed?
C forwards but accelerating?
D forwards but decelerating?
mv =
c2
t0
1
v2
v2
c2
m0
1
v2
c2
77
Seeing in a
weird light:
relativity
Solving Problems
13 Depending on your answer to Question 1, calculate
the magnitude of your speed or acceleration if the
Elvis Presley doll hangs at a constant angle of 10
from vertical.
L = L 1 v 2 c2
then the difference t2t1 between the times of flight
for the two arms would be zero. Use the equations
given for t2 and t1 in section3.2.
Re
78
iew
Q uesti o
c2
v2
c2
v2
m0
mv =
c2
t0
tv =
v2
2 2
l v = l0 1 v c
tv =
t0
2 2
1v c
mv =
m0
2 2
1 v c
v (A rel. to B) =
vA vB
1 vA vB / c 2
space
PHYSICS FOCUS
Cant measure the speed
of light
79
Multiple choice
(1 mark each)
1 Ignoring air resistance, all projectiles fired
horizontally from the same height above horizontal
ground will have the same:
A horizontal velocity.
B time of flight.
C range.
D final speed.
You have just rounded the top of a curve on a rollercoaster. The g-force meter you are carrying reads
exactly zero. Which one of the following is true?
A Your weight is the centripetal force.
B Your weight is zero.
C Your weight is equal and opposite to the normal
force exerted on you by the seat.
D Your weight is equal and opposite to the tension
in your body.
80
Short response
6 The escape velocity from the Earths surface, based
space
10 A 9000 kg helicopter is parked at the equator and
then later near the North Pole. Assuming the two
locations are chosen so that the gravitational potential
energy is the same at the two spots, estimate the
difference in relativistic mass at the two locations.
At which location will the mass be larger? (Hint:
Velocity is low, so use the classical expression for
kinetic energy.) (3 marks)
Extended response
11 Critically discuss the following proposition: The
MichelsonMorley experiment was an embarrassment
for physics because, despite a large effort, it failed to
find what it was looking for and so it should be
relegated to the dustbin of physics history. (5 marks)
81
2
Context
82
Motors and
Generators
The first recorded observations of the relationship between electricity and
magnetism date back more than 400 years. Many unimagined discoveries
followed, but progress never waits. Before we understood their nature, inventions
utilising electricity and magnetism had changed our world forever.
Today our lives revolve around these forms of energy. The lights you use to
read this book rely on them and the CD inside it would be nothing but a shiny
coaster for your cup. We use magnetism to generate the electricity that drives
industry, discovery and invention. Electricity and magnetism are a foundation for
modern technology, deeply seated in the global economy, and our use impacts
heavily on the environment.
The greatest challenge that faces future generations is the supply of energy.
As fossil fuels dry up, electricity and magnetism will become even more
important. New and improved technologies will be needed. Whether its a hybrid
car, a wind turbine or a nuclear fusion power plant, they all rely on applications
of electricity and magnetism.
INQUIRY ACTIVITY
Build your own electric motor
Many of the devices you use every day have electric motors. They spin your DVDs,
wash your clothes and even help cook your food. Could you live without them,
and how much do you know about how they work?
The essential ingredients for a motor are a power source, a magnetic field
and things to connect these together in the right way. Its not as hard as you
think. All you need is a battery, a wood screw, a piece of wire and a cylindrical or
spherical magnet. Put these things together as shown in Figure 4.0.2 and see
if you can get your motor to spin. Be patient and keep trying. Then try the
following activities.
1 Test the effects of changing the voltage you use. You could add another
battery in series or try a battery with a higher voltage.
2 Try changing the strength of the magnet by using a different magnet or
adding another. What does this affect?
3 Try changing the length of the screw, how sharp its point is or the material
it is made from. Does it have to be made of iron?
83
Electrodynamics:
moving charges
and magnetic
fields
Strange but true
Electric current
In the presence of an electric field, charged particles and free ions will move
towards a region with an opposite electric charge. In this module we will be
considering the movement of electrons within metal conductors such as copper.
If you connect a wire to the terminals of a battery, as in Figure 4.1.1, an electric
field is set up along the length of the wire between the two ends. The free
electrons in the metal wire are then attracted towards the positive terminal. The
movement of these electrons is called an electric current. Protons are bound
tightly inside the nuclei of atoms and the nuclei are essentially fixed in position.
This means that when a wire is connected to a battery the only charges free to
move inside the wire are the free electrons.
84
motors and
generators
An electric current is defined as the rate of flow of net charge through
a region. In the wire in Figure 4.1.1, a number of electrons will flow through the
area A each second, and each electron carries a charge of 1.6 1019 coulombs.
If the current is 1ampere (or 1amp), it carries 1 coulomb of charge per second
through area A, which is more than 6billion billion electrons each second.
We now know that the nature of an electric current is actually a flow of electrons.
However, in electric circuits we often consider currents as if they were a flow of
positive particles. This type of current is called conventional current and it is in the
opposite direction to the flow of the negatively charged electrons. This confusing
situation has arisen because current was first thought to be a flow of positive particles.
Conventional methods for determining the direction of other physical quantities,
such as magnetic fields and forces, have been developed using the conventional
direction of current. So we will stick with these historical conventions.
It is important throughout this module to consider the direction of an
electric current as the direction in which a positive charge would flow through a
conductor. In Figure 4.1.1, conventional current flows from the positive terminal
towards the negative terminal, as indicated by the arrow along the wire.
An electrical current in which the charges only flow in one direction is
called direct current (DC). This current is commonly used in small portable
electronic devices and is supplied by a battery. One way of illustrating this type
of current graphically is shown in Figure 4.1.2.
The red line on the graph is a direct current measured by a digital ammeter.
The sign of the current (+ or ) represents the direction in which the current is
travelling. You can see that in this example the current has a constant value and
direction over time.
In contrast to direct current, an alternating current (AC) is continually
changing direction. The sign of the terminal at each end of an AC circuit
alternates between positive and negative over time. Each time this occurs, the
electric field within the wire changes direction. This reverses the direction of the
force on the charges within the wire and the current changes direction accordingly.
This type of current is good for transporting electrical energy over large distances
and is commonly used in larger appliances. In Figure 4.1.2, an AC current
measured by a digital ammeter is shown as a blue line. As the current changes
direction the blue line moves above or below the horizontal
axis. The corresponding change in sign of the current
indicates a change in the currents direction.
area
A
DC
+
AC
Current (A)
copper wire
10 20
30
40
50
Time
(ms)
Flick of a switch
Electrodynamics:
moving charges
and magnetic fields
Birds on a wire
energy transferred
time taken for transfer
Recall that:
P = IV
V2
Substituting Ohms law V = IR: P = I2R or P =
R
where P is power in watts (W).
Watts are equivalent to joules per second (Js1), so we can determine the
energy lost:
Energy = Pt
where energy is in joules (J) and time is in seconds (s).
86
motors and
generators
magnetic field B
Electrodynamics:
moving charges
and magnetic fields
conventional current I
(a)
direction of the magnetic field around the wire in Figure 4.1.8. You should see
that the magnetic field lines would go into the page on the right-hand side of the
wire (represented by ) and come out of the page on the left (represented by ).
An extension of the situations you have reviewed above is the magnetic field
around a current-carrying wire loop. Again we use the right-hand grip rule to
determine the direction of the magnetic field around the current-carrying wire
(Figure 4.1.9). Notice that the magnetic field in the centre of the loop always
points in the same direction, no matter where your hand is around the loop.
In the following chapters there are many situations that involve loops of wire
carrying currents; therefore, it is important you are familiar with the magnetic
field that is produced around them.
Most applications of magnetic fields in current-carrying wire loops actually
involve more than one loop. Each loop is called a turn, and many turns together
are known as a solenoid (Figure 4.1.10). A solenoid is simply a long coil of wire,
and the magnetic field produced is similar to that of a bar magnet, with a north
The direction of the magnetic field through
and south pole at each end.
the centre of the solenoid is determined by using a special version of the righthand grip rule (Figure 4.1.10). In this situation, you must curl your fingers in
the direction of the conventional current around the solenoid and your thumb
will point in the direction of the magnetic field. Your thumb will point to the
end of the solenoid that forms a north pole. A coil such as this is used to make
an electromagnet or simply to produce a magnetic field.
(b)
Geographic
North Pole
Magnetic
South Pole
Magnetic
North Pole
Geographic
South Pole
Figure 4.1.11 The Earth acts as though it has a south magnetic pole near the geographic
north pole! The magnetic north pole is the place to which the north end of
a compass appears to point.
88
motors and
generators
Checkpoint 4.1
v
B
S
B
direction of the
magnetic field
direction of the force F
on the positive particle
89
Electrodynamics:
moving charges
and magnetic fields
Try this!
Bending beams of particles
Ask your teacher if they can show you a beam of electrons
in a Crookes magnetic deflection tube (see Figure 4.2.2).
With teacher supervision, bring the north pole of a magnet
close to the front of the tube and observe the effect on the
beam of electrons. By the right-hand palm rule, negative
particles experience a force out of the back of your hand.
Try to predict the direction in which electrons will be
deflected when you place the south pole of the magnet
close to the front of the tube. Now try it and then explain
it to a friend. Do they agree with your explanation?
Checkpoint 4.2
1 Explain why a stationary charged particle experiences a force when you move a magnet past it.
2 Identify the direction in which the proton in Figure 4.2.1a would be moving for it to experience a force into the page.
PRACTICAL
EXPERIENCES
Activity 4.1
90
motors and
generators
When we allow current to flow through a wire
within the magnetic field in Figure 4.3.1, we see
that the wire moves out of the page, at right angles
to both the magnetic field and the direction of the
conventional current. Each positive particle of the
conventional current within the wire in Figure 4.3.1
experiences a force due to its motion within the
external magnetic field. Since these positive particles
are within the wire, the force acts on the wire.
I
S
B
direction of the
magnetic field
F
PHYSICS FEATURE
Loudspeakers
B
C
direction of force
on the voice coil
A cardboard cone
91
Electrodynamics:
moving charges
and magnetic fields
F = BIlsin
where F is magnitude of the force on the wire in newtons (N), l represents the
length of the wire inside the magnetic field in metres (m), I is the size of the
current in amps (A), B is the strength of the magnetic field in tesla (T) and is
the acute angle between the magnetic field and the wire.
Note that you can rearrange this equation and make any of the variables the
subject to find their values.
Worked example
Question
If the wire in Figure 4.3.3 has a length of 5 cm within a 0.2 T magnetic field, the wire is at
30 to the field and it contains a current of 0.5 A, what is the force exerted on the wire?
I
B
l
SOLUTION
First we convert the length of the wire into the appropriate units: 5cm = 5/100 = 0.05 m
sin =
Figure 4.3.3
a
l
a = l sin
Then substitute:
F = BIlsin
92
If you have an equation that describes a relationship, the easiest way to see
how the variables affect the subject of the equation is to place some numbers in
the equation and see what happens.
Remembering that the motor effect is the force F that a current-carrying
conductor experiences in a magnetic field, lets look at how the other variables in
the equation affect the magnitude of F. As you read through this section keep
referring back to the equation and check that you come to the same conclusions.
motors and
generators
Current, I
As for magnetic field strength, by inspecting the formula we can see
that force F will be directly proportional to current I.
Length, l
Similarly, F is directly proportional to the length l of the currentcarrying conductor within the magnetic field. Be particularly careful to
remember that l is the total length of the wire within the magnetic field.
It is noteworthy that l is regarded as a vector, but current I is not.
Angle,
When the wire is parallel to the magnetic field, the angle is zero. Inspecting
Figure 4.3.4 you can see that if is zero degrees then sin is also zero. When
you substitute zero for sin in the motor effect equation, you see that the
This shows us the interesting situation that the force
force must be zero.
on a current-carrying conductor in a magnetic field is zero when the
conductor is parallel to the magnetic field lines.
When the current-carrying conductor is perpendicular to the magnetic
field, is 90. From Figure 4.3.4 you can see that sin90 is 1. Since 1 is the
maximum value for sin, when we substitute 1 into the equation the force F
will be the maximum value it can be for each set of the other variables.
This shows that the force F is a maximum value when a current-carrying
conductor is perpendicular to the magnetic field B.
Inspecting Figure 4.3.4 you can see that as increases from 0 to 90 the
value of sin increases towards a value of 1. The rate of this increase is not
So we can only say that
constant (i.e. the graph is not a straight line).
force F depends on , as it is not directly proportional to .
Nanotube
Loudspeakers:
No Magnets
sin
1
90
180
270
360
Checkpoint 4.3
1 Describe the relative directions of the force, the current and the
magnetic field when a current-carrying wire experiences the
maximum possible force due to the motor effect.
2 Compare the relationships of B, I, l and to F in the equation
F = BIlsin.
3 Explain the motor effect.
Electrodynamics:
moving charges
and magnetic fields
Qualitative analysis
Try This!
the motor effect
Take a piece of insulated wire
about 510 metres long.
Stretch it out between two
retort stands so that there are
two pieces of wire running
parallel within a few
centimetres of each other.
Connect the ends to a 12 V
battery and insert a tapping
key switch at one end of the
circuit. When connected
briefly, the currents will run
antiparallel to each other.
Caution: Connect these wires
for a very short time only, as
they carry a large current.
Predict what will happen when
you press the switch. Now
observe! How did you go?
Let us first consider the situation in which we have two parallel current-carrying
wires with currents that are travelling in the same direction.
In Figure 4.4.1a we use the right-hand grip rule to determine the direction of
the magnetic fields around the wires. Now, to understand what is happening to
each wire, we will consider what is happening to one wire at a time.
In Figure 4.4.1b we are looking at what is happening to wire 2. The magnetic
field generated by the current in wire 1 travels into the page around wire 2.
Using the right-hand palm rule, we can see that wire 2 experiences a force
towards wire 1.
In Figure 4.4.1c we now see what is happening to wire 1. The magnetic field
of wire 2 comes out of the page around wire 1. Therefore the right-hand palm
rule shows that wire 1 experiences a force towards wire 2.
The conclusion we can come to is that when two parallel currentcarrying conductors have currents travelling in the same direction, the two
conductors are forced towards each other.
II
F
=k 12
l
d
94
I2
F1
wire 1
wire 2
magnetic fields
around parallel wires
I1
wire 1
wire 2
magnetic field
due to I1
I1
I2
F2
wire 1
wire 2
magnetic field
due to I2
Now lets consider the two parallel current-carrying wires with currents that
are travelling in opposite directions.
In Figure 4.4.2a the right-hand grip rule shows us the direction of the
magnetic fields around the wires. To understand what is happening to each wire,
we will again consider each wire in turn.
Lets look at what is happening to wire 2 first (Figure 4.4.2b). The right-hand
grip rule shows the magnetic field of wire 1. This field travels into the page
around wire 2. The right-hand palm rule shows that wire 2 experiences a force
away from wire 1.
Figure 4.4.2c shows what is happening to wire 1. The magnetic field of wire 2
goes into the page around wire 1. The right-hand palm rule then shows that
wire 1 experiences a force away from wire 2.
The conclusion we can now come to is that when two parallel currentcarrying conductors have currents travelling in the opposite direction, the two
conductors are forced away from each other.
It may be easy for you to remember the two conclusions above about the
direction of forces on parallel wires, although remembering the result is generally
less important than knowing how you got there. If you forget the conclusions
motors and
generators
a
wire 1
I1
wire 2
wire 1
magnetic fields
around antiparallel wires
I2
wire 2
magnetic field
due to I1
I1
I2
wire 1
wire 2
magnetic field
due to I2
above and you know how to work them out yourself you can never get them
wrong. You will apply similar methods in other problems later in this module.
So if you are comfortable with these methods now, it will be easier later. If at any
time you have trouble using your right-hand rules, come back to the relevant part
of this chapter and revise. You will meet the skills you have used here several more
times yet and each occasion is a chance to test your knowledge.
I1
I l
d 2
II
F
=k 1 2
l
d
where F is the force on each wire in newtons and l is the length the wires are
parallel in metres, so F/l is the force on each metre of wire. k is the proportionality
constant 2 107 NA2, I1 and I2 are the currents in the two wires in amps, and
d is the distance between the two wires in metres.
95
Electrodynamics:
moving charges
and magnetic fields
1.5 A
1.0 A
Worked example
QUESTION
For the situation shown in Figure 4.4.3 calculate the magnitude of the force acting on
each wire.
0.5 m
SOLUTION
2 cm
Feeling
the pinch
he piece of copper
pipe shown in
Figure 4.4.4 was
crushed by lightning.
Just like two parallel
wires carrying currents
in the same direction,
the sides of this pipe
were pulled together
when a current of more
than 100000 amps
was present.
Figure 4.4.4
Dramatic evidence of
parallel conductors
experiencing a force
F=
= 7.5 106 N
Note that as this question asked for only the magnitude of the force, you do not have to
include a description of the direction. If asked, you should add that the force on each wire is
directed towards the other wire.
Checkpoint 4.4
1 Identify the two key facts that explain the interactions of two parallel current-carrying conductors.
2 Describe the interactions of two parallel current-carrying conductors.
96
PRACTICAL EXPERIENCES
motors and
generators
CHAPTER 4
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
F = BIlsin
Equipment: 2 strong horseshoe magnets or ceramic magnets on an iron yoke,
long wire, power supply, retort stand, clamp.
hanging from
a retort stand
Perform a first-hand
investigation to demonstrate
the motor effect.
Solve problems and analyse
information about the force on
current-carrying conductors in
magnetic fields using:
F = BIlsin
flexible wire
ceramic magnets
on an iron yoke
low voltage DC
power supply
Discussion questions
1 Describe the motor effect.
2 Discuss what happened when the current direction was changed.
97
Chapter summary
Electrodynamics:
moving charges
and magnetic fields
Review questions
Physically speaking
1
Across
4 Equation to determine the force between
two current-carrying wires (7, 3)
2
4
3
5
6
8
Down
1 Electrons moving in one direction (6, 7)
2 Unit of power
10
11
98
motors and
generators
Reviewing
1
2
+
b
Force (N)
0.05
0.08
0.11
0.17
0.1
0.2
0.3
0.4
solving Problems
13 Calculate the resistance in a circuit that has a battery
supplying 10V and current flow of 2.3A.
iew
Q uesti o
Re
99
5
electromagnetic induction,
Faradays law, magnetic flux,
magnetic field strength B, magnetic flux
density, emf, perpendicular area, Lenzs
law, law of conservation of energy, eddy
currents, induction cooktops, resistive
heating, eddy current braking
Induction:
the influence
of changing
magnetism
Electromagnetic induction
The discovery of electromagnetic induction was a giant step on the path
to modern technology. Our understanding of this phenomenon required
a great deal of new physics and involved the work of many individuals.
One man, Michael Faraday, led the way with his experimental genius
and intuitive diagrammatic reasoning.
Faraday lacked the mathematical skills to numerically describe his
discoveries, but James Clerk Maxwell took Faradays understanding and
eventually quantified all electromagnetism. Faradays law and Lenzs
law provide us with the tools to explain and predict
eddy currents and to understand their applications.
Later they will help us in our quest to uncover the
secrets of motors, generators and transformers.
100
motors and
generators
He noticed that this current was only induced while the magnet was moving.
He moved the magnet out of the coil, and this time he measured a current in
Following many
the opposite direction within the coil (Figure 5.1.1b).
other experiments, he put forward his general principle of electromagnetic
induction that a changing magnetic field can cause a current to be generated in
a wire. This change can be caused by either the relative motion of the field and
the coil or by a change in the strength of the magnetic field.
In light of Faradays conclusion, lets offer a simple way to understand his
observations. In Figure 5.1.1 we can consider the change in the magnetic field
through the coil to be represented by the number of magnetic field lines passing
within the coil. In Figure 5.1.1a, notice that the magnetic field around the bar
magnet is not uniform. You see that as you get closer to the poles of the magnet
the magnetic field lines get closer together, indicating that the field is stronger.
Therefore, as the magnet gets closer to the coil more field lines pass within the
coil, indicating that the magnetic field within the coil gets stronger. So, as the
magnet gets closer to the coil the strength of the field within the coil is changing
and this induces the current. This gives us a general explanation for
electromagnetic induction. Now lets take things a bit further.
Magnetic flux B
Magnetic flux is a measure of the amount of magnetic field passing
through a given area. There are two variables that determine the value of
magnetic flux: the strength of the magnetic field B and the area the field is
passing through A. Magnetic flux B is measured in weber (Wb) and can be
expressed by the equation:
B = BA
Magnetic field strength B, a quantity we are already familiar with,
is also called magnetic flux density. This quantity is measured in tesla (T), or
equivalently in webers per square metre (Wb m2). This is a measure of the field
strength per square metre.
A is an area that is perpendicular to the magnetic field lines. If a magnetic
field passes through a circular wire loop of area A (Figure 5.1.2a) and the loop is
at an angle to the field (see Figure 5.1.2b), then the field passes through an
effective area A that is perpendicular to the field and is smaller than area A (see
Figure 5.1.2c). Note that flux could also be calculated using the perpendicular
component of the magnetic field strength, B, and the total area A.
a
N
A
b
N
Our suns
magnetic
influence
PRACTICAL
EXPERIENCES
Activity 5.1
101
Induction:
the influence of
changing magnetism
This line is the perpendicular
height of area A.
B
A
Figure 5.1.2 (a) A circular coil of area A. (b) A side view of the coil at an angle to the magnetic
field. (c) The effective area of the coil perpendicular to the magnetic field viewed
from point P (b)
The product of the magnetic flux density B and the area A gives a
measure of the total amount of magnetic field passing through that area. This
is the magnetic flux B.
The delta () symbols in Faradays law mean a change in some quantity. So
the two terms with delta symbols attached are differences between an initial
value and a final value. The term B stands for the change in the magnetic
flux and is given by:
B = B final B initial
The t term represents a period of time. This is the period of time over
which the change in flux B is measured.
We can see that the B/t term in Faradays law is actually the rate
of change of magnetic flux, just as acceleration is the rate of change of velocity
(aav=v/t). The term B/t tells us how fast the flux is changing.
emf
The symbol stands for emf, measured in volts. It is the difference in electrical
potential between the two ends of a coil (X and Y in Figure 5.1.3).
This emf creates an electric field within the wire of the coil and a current is
established, provided the circuit is complete (X and Y are not connected in
Figure 5.1.3). A current will flow as long as there is a change in the magnetic
field within the coil.
Qualitative analysis of Faradays law
Lets now look at the relationship between emf and the other terms in the
equation. Recall that the equation for Faradays law is:
= n(B/t)
B
N
X Y
The emf is proportional to the rate of change of the magnetic flux (B/t).
If there is a large change in flux in a small amount of time, then B/t is
large; therefore, the emf produced will be large. This emf is responsible for the
induced current in a closed loop of wire. If the emf is large, then so is the
induced current. Another way of saying this is that if we wanted to induce a
large current, we would change the magnetic field within the coil as much as
possible in the shortest time possible. To do this in the example we have seen,
we could move the magnet more quickly, use a stronger magnet or make the
perpendicular area as large as possible for the coil.
The emf is also proportional to n, the number of turns in the coil. Again, if
we wanted to create a large current we would want to have as many turns in the
coil as possible.
motors and
generators
By inspecting all the variables in Faradays law we can conclude that
an induced potential difference, and therefore an induced current in a coil,
is proportional to the rate of change of magnetic flux (B/t) and the
number of turns (n) within the coil. Further, we can conclude that induced
currents are produced by changing the magnetic field strength B, the relative
motion between B and the area A, or changing the perpendicular area A of
the coil.
Our next challenge is to find the direction of an induced current, and for
that we need Lenzs law. We will cover this in section 5.2.
Try this!
skipping currents
soft
iron
ring
switch
coil B
coil A
IA
BA
IB
2
1
5
4
Induction:
the influence of
changing magnetism
Checkpoint 5.1
1 Outline Faradays discovery of induction by a moving magnet and summarise his conclusion.
2 Define magnetic flux in terms of magnetic flux density.
3 Using the term magnetic flux, explain why removing a magnet quickly from a coil induces a relatively large current.
direction of
movement
direction of
movement
S
N
N
As the north pole of the magnet gets closer to the wire loop in Figure 5.2.1a,
the magnetic flux passing through the coil increases. This change in flux induces
a current in the wire loop. Now lets find the direction of the current to agree
with Lenzs law.
The north pole of the magnet is coming towards the coil, so the magnetic
flux pointing downwards through the coil is increasing. Lenzs law says that the
magnetic field produced by the induced current should oppose this change in
flux, so the induced flux should point upwards through the coil. Using the
right-hand grip rule for a wire coil or solenoid, we see that the current would
need to flow in the direction shown in Figure 5.2.1b, to produce an upwardspointing magnetic field within the loop. This field is pointing in the opposite
direction to the changing magnetic flux, so it reduces the changing magnetic flux
from the approaching magnet. Simply, you can think of the interaction of these
two magnetic fields as if the north poles of two bar magnets are facing each
other. The two north poles repel each other, and in this way the induced field
acts to minimise the increasing magnetic flux within the coil.
104
motors and
generators
Now, lets look at what would happen when you move a magnet away from
the loop with its north pole facing the loop (Figure 5.2.2).
In this case the magnetic flux in the coil is decreasing and pointing
downwards. The induced magnetic field should therefore be also pointing
downwards to add to the reducing field and try to minimise the change. Using
the right-hand grip rule, we see that we need a current as shown to produce a
downwards-pointing induced magnetic field. Again, you can think of the
interaction of these two magnetic fields as the interaction of two magnets in
which a north pole is facing a south pole. The two ends of the magnets are
attracted to one another and in this way the induced field acts to add to the
decreasing flux within the coil. These examples leave us with Lenzs law as
a tool to determine the direction of an induced current in a wire coil due to
a changing magnetic flux.
direction of
movement
S
N
Checkpoint 5.2
1
2
3
105
Induction:
the influence of
changing magnetism
direction of swing
I1
F2 causing
braking effect
F1
F3
I2
+
F4 causing
braking effect
Figure 5.3.1 A square metal sheet is swung through a uniform magnetic field between two bar
magnets. A braking effect is observed due to induced currents and their magnetic
fields, in accordance with Lenzs law.
106
motors and
generators
a magnetic field out of the page (indicated by the N for the north pole of the
currents magnetic field). The flow of positive charges in the direction of F1 is a
current. This current experiences a force due to the uniform magnetic field. This
force F2 opposes the motion of the copper, acting as a braking effect.
As the copper square leaves the magnetic field (on the right in Figure 5.3.1)
Lenzs law tells us that the eddy current should produce a force to slow the squares
departure from the field. The positive charge shown experiences a force F3
upwards, as shown. This causes a flow of positive charges in the direction of F3.
This current experiences a force due to the uniform magnetic field. This force F4
opposes the motion of the metal sheet, again acting as a braking effect.
If you have access to the equipment that demonstrates the situation in Figure
5.3.1, then try observing it for yourself. You may also be able to observe the
effect of cutting slots through the piece of metal swinging in the field. The slots
limit the size of the eddy currents that can be produced and therefore the size of
the induced magnetic fields, and the braking effect is considerably less. We will
meet this idea of reducing the size of eddy currents again in chapters 6 and 7.
Try this!
Racing magnets
Find two identical magnets. Get a piece of copper or aluminium sheet
and a sheet of a non-metal, such as glass, with a surface similar in
smoothness to the surface of the metal. Place the two sheets at the
same angle (say 60) to the table surface and place the magnets at the
same height on each sheet (see Figure 5.3.2). Now predict which
magnet is going to win the race and why. Now race! Did everyone agree?
Explain your observations to a friend.
aluminium or
copper
glass
60
107
Induction:
the influence of
changing magnetism
PHYSICS FEATURE
PRACTICAL
EXPERIENCES
Activity 5.2
Induction cooking
Q = Pt = I2Rt
where Q is in joules, power P is in watts or joules
per second (Js1), I is in amps A, R is in ohms ()
and t is in seconds (s).
eddy currents
produced in
base of frypan
ceramic surface
coil supplied with
high frequency AC
AC
rapidly changing
magnetic field
B
Checkpoint 5.3
1
2
108
Explain the formation of eddy currents in a small, flat, square metal sheet that falls between the poles of a magnet.
Describe how Lenzs law can be used to predict the formation of eddy currents.
PRACTICAL EXPERIENCES
motors and
generators
CHAPTER 5
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Perform an investigation to
model the generation of an
electric current by moving a
magnet in a coil or a coil near
a magnet.
Plan, choose equipment or
resources for, and perform a
first-hand investigation to
predict and verify the effect on
a generated electric current
when:
the distance between the coil
and magnet is varied
the strength of the magnet
is varied
the relative motion between
the coil and the magnet is
varied.
Plan, choose equipment or
resources for, and perform a
first-hand investigation to
demonstrate the production
of an alternating current.
Gather, analyse and present
information to explain how
induction is used in cooktops
in electric ranges.
Gather secondary information
to identify how eddy currents
have been utilised in
electromagnetic braking.
109
Chapter summary
Induction:
the influence of
changing magnetism
Review questions
Physically speaking
The theme of this word search is induction.
There is a twist; there is no list provided, so
you have to work out the words that have been
included. Find the 10 hidden words that have to
do with induction, list them and write their
definitions.
110
W N
motors and
generators
Reviewing
1 Outline Faradays experiment that led to the discovery
of electromagnetic induction.
wire
bulb
rails
conservation of energy.
Solving problems
15 Justify the claim that induction cooktops need
special cookware.
iew
Q uesti o
Re
111
Induction:
the influence of
changing magnetism
PHYSICS FOCUS
Eddy Currents Stop Me
in My Tracks
N
S
S
N
N
S
S
N
N
S
S
N
coils
N
S
S
N
top of rail
112
top of
stationary
rail
coil wound
around soft
iron core
location
of coil
+
I
rail
direction
of trains
motion
motors and
generators
Extension
30
25
20
15
10
5
0
0
20
40
60
80
100
120
140
160
113
Motors: magnetic
fields make the
world go around
The magic of motors
stator
(curved magnets)
commutator contacts
coils
motors and
generators
Table 6.1.1 Parts of a DC motor
Part
Coils
Many loops of wire that carry a direct current. These wires experience
a force (due to the motor effect) that causes the motor to turn when
current is flowing.
This is the part of a motor that contains the main current-carrying
coils or windings. For DC motors this is the rotor, but for AC motors it
is usually the stator.
In a DC motor, the rotor consists of coils of wire wound around
a laminated iron frame. The frame is attached to an axle or shaft that
allows it to rotate. The iron frame is laminated to reduce heating and
losses due to eddy currents. The iron itself acts to intensify the
magnetic field running through the coils.
Stationary permanent magnets (electromagnets in large DC motors)
that provide an external magnetic field around the coils. Permanent
magnets are curved to maximise the amount of time the sides of the
coil are travelling perpendicular to the magnetic field, to maintain
maximum torque.
A device with metal semicircular contacts that reverse the direction of
the current flowing in each coil at every half rotation. This reversal of
the current makes continuing rotation possible in a DC motor.
Conducting contacts (generally graphite or metal) that connect the
commutator to the DC power source.
Armature
Rotor
Stator
Commutator split-ring
Commutator brushes
Brushless DC motors
N
S
N
Now that we have seen the basic structure of a DC motor, we need to turn our
attention to understanding and explaining its operation. In section 4.3 we
encountered the motor effect, but now we will apply it to rotating a currentcarrying coil. Figure 6.1.1 shows that motors contain many coils of wire. For
simplicity we will begin by looking at a single coil and how it would act as a motor.
115
Motors: magnetic
fields make the
world go around
pivot
where torque in newton metres (Nm) is given by the distance d from the pivot
If the force is not acting
in metres multiplied by the force F in newtons.
at right angles to the see-saw (Figure 6.1.3b), it is the perpendicular component
F of the force that is used. As F = Fcos, the formula for torque becomes:
= Fd = Fcosd
b
d
F
cos =
F
Notice that the angle between F and F is the same as the angle the
see-saw makes with the horizontal. We will use this fact again soon. Note that
you could also use = Fd.
Worked example
QUESTION
axis of rotation
FYZ
FWX
Calculate the torque exerted on the see-saw in Figure 6.1.3b if a force of 980N was acting
at a distance of 4.0 m from the pivot at an angle of 30.
SOLUTION
= Fd = Fcosd
= 3390 Nm
= F d
b
FWX
WX
rotation
axis of rotation
YZ
FYZ
We will use the wire between points W and X as our example and consider
the situation in Figure 6.1.4b in which the coil has rotated. We must now
account for the fact that force FWX is not acting perpendicular to the coil, so we
use the formula seen previously:
= Fd = FWXcosd
As the force FWX is the motor effect, we recall the equation:
FWX = BIlsin
Note that in this equation is the angle we met in section 4.3. It is the
angle of a current-carrying wire to the magnetic field through which it passes.
This is not the same angle as in the cos term of the previous equation. Since the
motors and
generators
two sides of the coil that experience a force (wx and yz in Figure 6.1.4) are
always perpendicular to the field, = 90. This makes sin = 1 and therefore:
FWX = BIl
Combining this equation with = FWXcosd (using the other !) gives:
= BIlcosd
where l is the length W to X (WX) and d is half the length of X to Y (XY/2).
Now we consider the whole coil. The two forces FWX and FYZ are equal and
both make the coil rotate in the same direction. The total torque on the coil is:
total = 2BIlcosd
The product 2ld is the area of the coil A. This gives:
= BIAcos
Motors have more than one coil, so we make one final adjustment to this
expression, giving:
= nBIAcos
where n is the number of turns in the coil.
This formula quantifies the
torque (turning effect) on a rotating coil in newton metres (Nm). It is important
to note that the angle in this equation is the angle of the coil relative to the
magnetic field, as shown in Figure 6.1.4b.
Worked example
QUESTION
axis of
rotation
The coil in Figure 6.1.5 contains 50 turns and is carrying a 6.0 102 A current in a
5.0 103 T magnetic field. Calculate the torque on the coil if the coil has rotated 30
relative to the field.
= 4.5 105 Nm
cm
cm
SOLUTION
Using = nBIAcos and converting the lengths to metres:
= 50 5.0 103 6.0 x 102 (0.06 0.05) cos30
F
B
Figure 6.1.5
PRACTICAL
EXPERIENCES
Activity 6.2
= nBIAcos
117
Motors: magnetic
fields make the
world go around
In this example the values of all the variables on the right-hand side of the
The variation in the cos
equation remain constant, except for the angle .
term accounts for the variation in the magnitude of the perpendicular force F
experienced by the sides WX and YZ. This force determines the amount of torque
acting on the coil, and therefore the torque varies accordingly as the coil rotates.
At position 1 in Figure 6.1.7b, the perpendicular force F on the coil is at a
maximum and therefore the torque is at a maximum. This occurs when is
zero and cos is 1.
At position 2, the coil has rotated 45 with respect to the magnetic field. The
magnitude of the perpendicular force (F) acting on the coil is less than at
position 1. This means that the torque acting on the coil has reduced. At this
point, cos is approximately 0.7. Note that real motors have curved magnets
in the stator to ensure that each coil maintains = 0 as long as possible. This
provides maximum torque for the longest time possible during each rotation.
In the example in Figure 6.1.7, the magnets in the stator are not curved, so
only at positions 1 and 5 is = 0.
At position 3, the perpendicular component of the force F has dropped to
zero. Consequently, there is no torque acting on the coil at this instant. Here
is 90 and cos is 0, so torque is zero. At this point the brushes and splitring
within the commutator have broken contact momentarily and no current
flows in the coil. The current needs to be reversed in the coil at this point, so
that the motor continues to turn for the next half rotation.
Between position 3 and 4, the commutator has reconnected the coil and
reversed the direction of the current flowing in the coil (now flowing through
points Z, Y, X and W in turn). The opposite sides of the split-ring
commutator are now connected to the terminals of the battery. If the current
had remained in the original direction once the motor rotated past point 3,
the forces on the coil would have been in the opposite direction to the initial
rotation. This would stop further rotation.
As the motor moves through position 4, the perpendicular component of the
force F is increasing. This means the torque is increasing and will reach a
Try this!
Model a simple motor
a
X
F
YZ
WX
c
Position
= 0
WX
Torque
0 max.
YZ
F
= 45
WX
= 90
YZ
F
F
YZ
F
YZ
WX
F
= 45
WX
F
= 90
Time
motors and
generators
maximum again at position 5. At position 5 the coil has performed half a
rotation and the pattern we have seen from positions 1 to 5 will repeat until
the coil is back to its original orientation at position 1.
PRACTICAL
EXPERIENCES
Activity 6.1
PHYSICS FEATURE
Galvanometers
pointer
force
permanent magnet
spring
coil
pivot
moving
coil
soft iron
core
magnetic field
Checkpoint 6.1
1
2
3
Motors: magnetic
fields make the
world go around
B
Current (A)
motor current
0
0
Time (ms)
Try this!
a back emf
Connect a DC motor to a
battery and place an ammeter
in the circuit to measure the
current flowing through the
motor. Predict how the amount
of current flowing will change
if you apply a significant load
on the motor. Try it! How did
you go?
120
We can see that this changing flux B over time t would generate a
potential difference or emf .
This potential difference (back emf ) is in the opposite direction to
the applied potential difference (supply emf) that causes the rotor to turn. As
the speed of the motor increases the back emf increases, as B/t increases.
Eventually this potential difference cancels out most of the applied potential
difference and virtually no current flows through the coil. At this point there
is no net torque acting on the rotor and it turns at a constant speed.
Let us analyse this situation using Figure 6.2.1. At time A on the graph,
the motor is connected to a DC power source. The applied potential
difference causes a current (blue line) to flow in the motors coil and this
builds quickly to its maximum value. Once the coil starts turning, a back emf
is generated due to the changing flux within the coil. This back emf is in the
opposite direction to the applied potential difference and therefore reduces
the net potential difference, which in turn reduces the current flowing in the
coil as the speed of rotation increases.
At time B, the motor has reached its maximum rotational speed. Here
most of the supply emf has been cancelled out by the back emf. If the motor
has no load attached to it, only a small current continues to flow. This residual
current is required to overcome any friction within the motor and any voltage
drop due to losses such as resistive heating. There is no net torque on the coil
between times B and C and the motor operates at a constant speed.
motors and
generators
At time C, a large load is applied to the motor, such as the motor turning
a wheel to move a toy car. The motor slows down quickly under this load and
the amount of back emf is reduced. This means that the applied potential
difference is greater and therefore a larger current flows through the coil.
At time D, a larger current continues to flow through the coils. If the
motor is not designed to handle the resistive heating produced by this larger
current, the motor may burn out. Burn out occurs when insulation melts at
high temperature, and may cause other components to melt. The motor will
then cease to operate efficiently.
Back emf
measures
motor speed
Checkpoint 6.2
1
2
3
active
neutral
earth
Current
+
0
Time
121
Motors: magnetic
fields make the
world go around
AC motors have the same basic components as DC motors but lack the
need for others. They all contain a rotor and stator. Their magnetic fields can be
generated by current-carrying coils. They utilise the motor effect to transform
electrical potential energy into rotational kinetic energy. Lets explore examples
of the AC induction motor that exist both in industry and in the home.
electromagnets
rotor
Universal motors
armature coils
carbon brushes
shaft
stator coils
segmented
commutator
spring-loaded
brush holders
Figure 6.3.4 A typical universal electric motor, showing the main components.
Some motors would have additional stator coils. The commutator
feeds current to the armature coils in the position where most
torque will be experienced.
122
motors and
generators
a
conducting bars
Current
+
0
Time
i
rotor
ii
iii
2
end ring
Figure 6.3.5 (a) In a three-phase motor, as the current in each pair of opposite
coils peaks, the field appears to rotate, dragging the rotor around
with it. (b) A squirrel cage rotor. The rotor is made of iron
laminations to cut down undesirable eddy currents. The induced
currents flow lengthwise in copper or aluminium rods which are
joined at the ends (as in a squirrel cage).
stator pole
These motors contain a squirrel cage rotor (Figure 6.3.5b) that does not
Around the circumference of these
require the input of an external current.
rotors are a number of parallel conducting bars. These bars are joined at the ends
by an end ring that allows current to flow from one bar to another. The rotating
stator field induces a current in these bars and a magnetic field is induced in
accordance with Lenzs law. This induced magnetic field interacts with the rotating
field from the stator and the resulting forces cause a torque on the rotor. This
causes the rotor to spin without the need for a commutator as in a DC motor.
As these motors do not need brushes and therefore have fewer moving parts, they
are more efficient and more reliable.
Another way to understand the operation of an AC induction motor is to
consider a positive particle within one of the conducting bars in the stator.
Lets consider the bar marked A in Figure 6.3.6a. As the rotating magnetic field
moves upwards past bar A this is equivalent to the bar moving downwards in a
stationary magnetic field. Figure 6.3.6b shows this equivalent situation in which
bar A moves relative to the magnetic field. Using the right-hand palm rule
(see section 4.2) we see that a positive particle in bar A would experience a force
into the page. This is equivalent to a current being induced into the page in bar A.
Now we must use the right-hand palm rule (see section 4.3) to deduce the direction
of the motor effect on a current-carrying conductor. This indicates that a force is
exerted upwards on bar A and this is in the same direction as the rotating field in
Figure 6.3.6a. This force on bar A is the same as the force experienced by each bar
as the magnetic field rotates. These forces, in the same direction as the rotating
stator field, exert a torque on the rotor and are responsible for its rotation.
a
stator
B
squirrel cage
conducting bars
rotating
magnetic
field
B
I
motion of bar A
relative to the
magnetic field
Motors: magnetic
fields make the
world go around
Try this!
Induction exerts forces
Suspend a small piece of
a lightweight conductor (e.g.
aluminium foil) from fishing line.
Use your right-hand palm rule to
predict the force acting on the foil
as you move a strong magnet
vertically past the foil. Can you
observe any movement? If not,
use the scientific method to
discover why its not working
and try again.
conducting bar
rotor
shading rings delay
the phase of part of
the field to produce
a rotating field
thick, copper
shading ring
Figure 6.3.7 (a) A shaded-pole AC induction motor taken from a small household fan, with
(b) a cross-section of the rotor. The conducting bars can be seen clearly within
the laminated rotor. (c) The principle of a simple single-phase, shaded pole
induction motor. The distorted (or shaded) field causes the rotor to turn in
one direction in preference to the other.
124
motors and
generators
Advantages
Disadvantages
Common applications
Simple (brushed) DC
motor
Efficiency (%)
4090
Brushless DC motor
3090
AC universal motor
4060
Three-phase AC
induction motor
7090
2035
Checkpoint 6.3
1
2
3
125
Motors: magnetic
fields make the
world go around
PRACTICAL EXPERIENCES
CHAPTER 6
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Discussion questions
1 Outline how the motor effect is used to make music in a loudspeaker.
2 Explain how a loudspeaker differs from a motor in its use of the
motor effect.
3 Determine the difference between the way in which the motor effect
is used in a loudspeaker and in a galvanometer.
armature
Part A: Make a loudspeaker and determine how the motor effect is used to make
it work.
Equipment: two horseshoe magnets, strong sticky tape, thin insulated wire,
cardboard, power supply, alligator clips and wires.
Part B: Make a working galvanometer and determine the differences in this
application of the motor effect.
Equipment: PVC-covered copper wire (150 cm) with bare ends, wooden base
board, armature block, magnets, split pins, knitting needle, rivets, wire
strippers, drinking straw, rheostat (1015 ohms, rated at 5 A or more).
brush
Make a motor like the one shown and note what factors change its performance.
Calculate the torque of your motor.
Equipment: insulated wire, magnets, magnetic field sensor and data logger
(if available), paperclips, Blu-Tack, connecting wires with alligator clips,
power supply.
Discussion questions
1 Investigate the factors that determine the effectiveness of the motor.
2 Calculate the amount of torque in your motor and list ways in which
torque can be increased.
Using the equipment supplied, make a model of an AC induction motor and relate
each part to the parts in a real AC motor.
Equipment: aluminium foil, fishing line, retort stand and clamp, ceramic
magnet.
Discussion questions
1 Outline how the metal is made to move.
2 Explain why the AC induction motor is so efficient.
126
Chapter summary
motors and
generators
Review questions
Physically Speaking
Reviewing
Split-ring commutator
Squirrel cage
Brushes
Stator
Fan
Bearings
Armature
Magnets
D
E
Figure 6.4.2
127
Motors: magnetic
fields make the
world go around
D
A
motor
motor shaft
r
edge of table
string
mass
Figure 6.4.3
11
12
13
14
Figure 6.4.4
solving Problems
16 Calculate the maximum torque that is generated by a
force of 460N applied to an object at a distance of
3m from its axis of rotation.
Re
128
Q uesti o
a
b
c
d
iew
= F d = mg d
Mass = 0.5kg
Torque (Nm)
Current a
Radius (m)
0.27
0.1
0.055
0.54
0.2
0.11
0.81
0.3
0.165
1.08
0.4
0.22
1.35
0.5
0.276
1.62
0.6
0.331
motors and
generators
PHYSICS FOCUS
Linear motors
Research
8 Find out exactly how a linear motor works.
9 Compare the torque produced in a standard motor
with the linear force in a linear motor.
129
Generators and
electricity supply:
power for the
people
Technology that changed our lives
Permanent magnets
or electromagnets
provide an external
magnetic field.
In chapter 6, we saw that the real value of electric motors was that they convert
electrical potential energy into rotational kinetic or mechanical energy. A logical
question that follows is how do we get the electrical potential energy that turns
these motors?. For many of our household applications, the answer is we use
a generator. When you switch on an electrical appliance at home, you may not
realise where the energy comes from. In some distant power plant there is a huge
rotating machine as big as a building that looks like a huge electric motor. The
difference is that this generator is producing electricity instead of using it.
The simplest AC motor design is the synchronous AC
motor (Figure 7.1.1). If this motor was supplied with 50Hz
AC from the power point, it would spin at 50 revolutions
per second in synchronisation with the electrical signal. This
S
design has no practical application, but more complicated
designs are used in clocks and tape drives, due to their
constant speed.
AC supply
Slip-ring commutator continuously
connects the rotating coil to the AC supply.
130
motors and
generators
Although it lacks applications as a motor, this design would produce an
electric current if you turned the coil with your hands. In this way it is acting as
When we turn the motor ourselves, its coil experiences a
a generator.
changing magnetic field and an emf is induced (recall Faradays law). If the loop
is connected by a circuit, a current flows, and we have turned the motor into
a generator.
The parts of a generator are essentially the same as those of a motor (see
Table 7.1.1). The difference is that we physically turn a generator and it produces
electrical potential energy. So the operation of a generator is the opposite of that
of a motor. In this section we will only consider simple generators; we will see a
more complicated version in section 7.3.
Armature
The armature is the part of a generator that contains current-carrying coils. These carry an induced
current caused by a changing magnetic field. For simple generators these are the coils in the rotor, but
in other generators the armature is in the stator.
These are the many loops of wire that carry electrical current. In many generators there are two sets of
coils. One set is the electromagnets that provide the magnetic field (in simple generators this field is
provided by permanent magnets). The other set is in the armature and these electromagnets carry the
current produced by the generator.
The rotor generally consists of coils of wire wound around a laminated iron frame. The frame is attached
to an axle or shaft that allows it to rotate. The iron frame is laminated to reduce heating and losses due
to eddy currents. The iron itself acts to intensify the magnetic field passing through the coils.
In simple generators the stator is the stationary permanent magnets or electromagnets that provide an
external magnetic field around the rotor. These magnets are curved to maximise the amount of time the
sides of the rotor coil are travelling perpendicular to the magnetic field. In some generators the stator
contains the armature coils and a magnet turns as part of the rotor to produce a changing magnetic field.
A simple DC generator contains semicircular metal contacts (a split ring) that reverse the direction of the
current flowing out of the rotor coil every half rotation. This reversal of the current ensures that the
current being produced is DC.
A simple AC generator has two circular metal contacts. Each slip ring is connected to one end of the
coils in the rotor. These provide an alternating current that changes direction every half rotation. In more
complicated examples these provide a current to an electromagnet in the rotor and a current is produced
in the stator.
Brushes are conducting contacts (generally of graphite or metal) that connect the commutator to the
external circuit.
Coils
Rotor
Stator
Commutator
split ring
Commutator
slip ring
Commutator
brushes
A simple AC generator
PRACTICAL
EXPERIENCES
Activity 7.1
Generators and
electricity supply:
power for the people
slip-ring commutator
to produce AC
slip-ring commutator
to produce DC
S
coils or
windings
rotor and
armature
stator
magnets
output terminals
Figure 7.1.2 (a) A typical AC and DC generator found in high schools and (b) a close-up
showing the main parts of this generator (see Table 7.1.1 for details)
At position 1 (Figure 7.1.3b), the plane of the coil WXYZ lies parallel to the
magnetic field. This means that no magnetic field lines are passing within
the coil, so the amount of magnetic flux B within the coil is zero. Recall
Faradays law:
Try This!
Modelling generators
Use the model of a simple
motor suggested in section 6.1
to model the operation of a
generator. Add slip-ring and
split-ring commutators by
sticking pieces of aluminium
foil around the cork. Ensure
you can describe and explain
the current that would be
produced during a full rotation.
ractiv
nte
M o d u le
132
= n(B/t)
which shows that the emf induced in the coil is proportional to the rate of
change in magnetic flux B/t passing within the coilnot the flux value
itself. At position 1, the emf is at a maximum because the rate of change of the
magnetic flux is at its maximum. This can be seen in Figure 7.1.3c in which
the tangents to the curve (illustrating the slope of the curve) for the magnetic
flux B represent the rate of change of magnetic flux B/t. The tangent at
position 1 is at the maximum positive value and, therefore, so is the induced
emf, according to Faradays law.
If the output terminals of the generator are connected, the induced emf
in the coil will cause a current to flow. At position 1, this occurs in the
direction marked (i.e. through Z, Y, X and W in turn). To determine the
direction of this current, lets consider a positive charge in the wire between
points W and X at position A. The right-hand palm rule shows that a charge
moving upwards at position A, as the coil rotates within the magnetic field,
will experience a force towards point W. You could also determine the direction
of the current using Lenzs law. Review Lenzs law and then give it a try.
At position 2 the coil has rotated 45. The amount of flux passing through the
coil has increased from zero and will continue to increase until position 3. The
rate of change of magnetic flux B/t at position 2 has decreased and will
continue to decrease until the coil reaches position 3. Around position 3 the
flux is changing relatively slowly. At position 3 the flux reaches a maximum
and then starts to decrease. Since the rate of change of flux B/t is zero
(as shown by the slope of the tangent) at position 3, the emf must also be
zero according to Faradays law. This means there will be no current flowing
through the coil at this point.
b
a
rotation
X
B
A
W
Position
WX
YZ
YZ
YZ
+max.
WX
and B
max. 0
YZ
WX
motors and
generators
WX
tangents show
rate of change
WX
YZ
YZ
WX
YZ
7
WX
YZ
8
axis of
rotation
output terminals
WX
WX
YZ
Time
Figure 7.1.3 (a) A simplified model of an AC generator and (b) the coil as seen from point P in part (a), showing the direction
of the current flowing over one rotation. (c) A graph of magnetic flux and induced emf over one rotation
Once the coil rotates past point 3, the flux B begins to decrease and the rate
of change of flux B/t is increasing. We can see this as the slope of the
curve for B gets bigger (as shown by a tangent to the curve). When the flux
B is decreasing, B=BfinalBinitial is a negative value and so is
B/t. This means that according to Faradays law the induced emf now has
the opposite sign to the sign it had before position 3. This means that the
current produced by this emf is in the opposite direction to the one that
would have flowed before position 3. This explains why we get an AC current,
because at positions 3 and 7 in Figure 7.1.3b the current will change direction.
Between positions 3 and 5 the flux decreases and is zero at position 5. The rate
of change of flux increases to a maximum at position 5. This means that the
emf increases to a maximum at point 5 and, therefore, so does the induced
current. Notice that in Figure 7.1.3b the current flowing along the side of the
coil WX is now flowing into the page. You can use the right-hand palm rule to
show that the current is now flowing through points W, X, Y and Z in turn.
This further illustrates that the current has reversed direction at point 3 and
we are producing an AC current at the output terminals.
Follow the rotation through to position 9 and you will see that the current
changes direction again at position 7 and the full cycle is completed at
position 9. Each rotation induces an emf in the coil that is in one direction
for half a rotation and in the opposite direction for the other half of a rotation.
This alternating emf can produce an alternating current and our explanation
of the operation of AC generators is complete.
A simple DC generator
Now that we have seen how an AC generator works, lets look at a DC generator.
Figure 7.1.5a shows a simple model of a DC generator. The obvious
difference is the split-ring commutator connecting the coil to the external circuit.
We have seen this situation in a simple DC motor (see Figure 6.1.7) but now
it is acting as a DC generator. Note that the graph for emf is the emf that would
be measured at the output terminals. This is therefore affected by the inclusion
of a split-ring commutator that is fundamental to the functioning of a DC
generator. Lets focus on what is different about the operation of a DC generator.
A Linear
Generator
magnet
motion of magnet
coil
133
Generators and
electricity supply:
power for the people
As before, we see that the change in the rate of change of magnetic flux
B/t in the coil causes a changing induced emf (Figure 7.1.5c). The
operation of this generator is the same as the AC example until we get to
position 3. At position 3 the brushes of the commutator reverse the contact
between the coil and the external circuit.
The current flowing in the coil WXYZ is in the same direction as for an AC
generator (see Figure 7.1.3b). The difference is that now this current only
flows in one direction in the external circuit. As the current in the coil
changes direction at position 3, the split-ring commutator reverses the
connection. This causes the current to flow in the same direction in the
external circuit throughout the entire rotation.
From the graph of emf in Figure 7.1.5c you can see that the emf induced in
the external circuit rises and falls but is always in the same direction. This
produces a current that also rises and falls and is always in the same direction.
This is DC. We usually see a DC (or emf ) represented on a graph as a
straight line with one value (e.g. the red line in Figure 4.1.2). This might be
the case with a current from a battery, but not for the output direct from a
DC generator.
At position 7 we see that again the split-ring commutator reverses the contact
to the external circuit. A current in this external circuit would continue to
flow in the same direction, even though the current in the coil again changes
direction.
Regenerative
braking
n energy-efficient way to
slow a moving vehicle is to
convert the kinetic energy of the
vehicle into electricity. This
energy can then be stored and
reused to get the vehicle moving
again. Traditionally braking was
achieved by using friction and
the energy was lost as heat.
Modern trains and electric hybrid
automobiles now use these
systems to increase efficiency.
The electric motors on the
wheels of the vehicle are used as
generators when the brakes are
applied, and energy is stored in
batteries or a capacitor.
Position
b
1
2
a
X
S
axis of
rotation
+max.
WX
YZ
YZ
tangents show
rate of change
B
WX
YZ
Z
WX
Y
and B
max.
YZ
WX
WX
YZ
YZ
WX
YZ
7
output terminals
WX
YZ
8
WX
WX
YZ
Time
Figure 7.1.5 (a) A simplified model of an DC generator and (b) of the coil as seen from point P in part (a), showing one complete rotation.
(c) A graph of magnetic flux and induced emf over this rotation
134
motors and
generators
PHYSICS FEATURE
Comparing motors
and generators
mechanical
energy output
motor
135
Generators and
electricity supply:
power for the people
Checkpoint 7.1
1
2
3
7.2 Transformers
Discuss why some electrical
appliances in the home that
are connected to the mains
domestic power supply use
a transformer.
Describe the purpose of
transformers in electrical
circuits.
Our everyday use of electricity requires a large variety of electrical currents and
voltages. With DC, this usually means that we have the right battery for the job.
With AC, its another story altogether. When you plug in your laptop computer,
it typically requires an AC input of about 18.5V, but the average value of the
voltage from the power point is about 240 V. (Note: the Australian standard is
now 230 V.) The conversion of AC to the correct voltage occurs, with a range of
+10% to 6%, in a small box on the laptops power cable, which contains a
device called a transformer.
changing magnetic flux
Try This!
exploring
electromagnetism
To observe the magnetic flux
produced by a changing current,
place a wire with many loops near or
wrapped around a magnetic compass.
Connect a battery to your coil and
observe the compass needle as you
connect and disconnect the circuit.
340
200
22
32
0 280 3
0 26
00
24
14
40
20
160
60
80
100
12
136
AC supply
voltage Vp
voltage Vs
primary coil
with np turns
load
secondary coil
with ns turns
Figure 7.2.1 In an ideal transformer, the iron core ensures that all the flux generated
in the primary coil also passes through the secondary coil.
motors and
generators
Lets consider applying a 50Hz input voltage Vp across the primary coil of
the transformer in Figure 7.2.1, which has np turns (or loops) of wire. If we
assume there are no losses, this will produce the same changing magnetic flux
B/t within both coils. This changing magnetic flux will produce a 50Hz
alternating voltage Vs across the secondary coil with ns turns. Substituting into
Faradays law gives:
B
B
Vp = np
and Vs = ns
t
t
np
ns
This expression shows us the relationship between the number of turns and
the voltages across each coil of a transformer.
If we analyse the expression above, we can see that if the primary coil has
more turns (loops) than the secondary coil (i.e. np/ns>1), the voltage Vp across
the primary coil will be greater than the voltage Vs across the secondary coil.
This is called a step-down transformer because it produces AC with a
lower voltage. When the secondary coil has more turns than the primary coil (i.e.
np/ns < 1), the voltage Vs measured across the secondary coil is higher than the
This is called a step-up transformer because it produces
input voltage Vp.
an AC output with higher voltage. Figure 7.2.3 is a simplified diagram showing
the change in magnitude of the output voltage Vs compared to the input voltage
Vp for these two types of transformers.
Vs
Worked example
Figure 7.2.3 The simplified graphs show
QUESTION
Calculate the number of turns required in the secondary coil of a transformer to produce
18.5V AC from 230V AC if there are 100 turns in the primary coil.
SOLUTION
Using:
Vp
Vs
np
ns
V
ns = np s
Vp
18.5
Substituting Vp = 230 V, Vs = 18.5V and np = 100 gives: ns = 100
230
= 8 turns
and rearranging to make ns the subject gives:
Vp
Vs
np
ns
137
Generators and
electricity supply:
power for the people
Efficiency is often expressed as a percentage and can be determined using the
following equation:
useful power output
Efficiency (%) =
100
total power input
The losses of energy that occur within a transformer cause the useful electrical
power output to be less than the power input. We saw in Chapter 4 that power is
the rate at which energy is converted and can be determined using the following
relationships:
V2
P = IV, P = I2R and P =
R
where P is electrical power in watts (W), I is current in amps (A), V is potential
difference (or voltage) in volts (V) and R is resistance in ohms ().
In formulating the equation:
Vp np
=
Vs ns
we assume that there are no losses; that is, that transformers are ideal.
We will
continue this assumption whenever we do calculations for transformers, but it is
important to consider transformers in a more realistic way, to understand their
losses and design considerations.
Using the equation above, we can determine another relationship for a
transformer that illustrates the changes in current. In an ideal transformer the
power input in the primary coil (Pp) and the power output in the secondary coil
(Ps) would be:
Pp = IpVp and Ps = IsVs
Neon lights
138
This illustrates that both the voltage and the current output of a
transformer change with respect to the input. We have therefore gained another
insight into the workings of transformers. By inspecting both equations:
Vp np
I s np
=
and
=
I p ns
Vs ns
motors and
generators
As we already know, the voltage is changed in a transformer and, for the above
relationship to hold, the change in current must be inversely proportional to the
change in voltage. We have already shown that this is the case by deriving the two
expressions:
Vp np
I s np
=
and
=
I p ns
Vs ns
Combining these expressions gives:
I s Vp
=
I p Vs
This expression shows that current and voltage are inversely proportional.
For example, in a step-up transformer Vs is larger than Vp so Vp/Vs must be less
than 1. In this case, Is/Ip is also less than 1. This means that Is is less than Ip.
We now see that transformers obey the law of conservation of energy
when they alter the current and voltage of AC electricity. For an ideal transformer,
the input power is equal to the output power and this is maintained by the inverse
relationships between input and output current and voltage.
These findings explain the design features evident in Figure 7.2.5, which
shows a step-down transformer. The increase in current in the secondary coil is
the reason a larger diameter wire is used for the secondary coil.
As discussed in Chapter 5, larger currents produce greater losses due to
resistive heating. The amount of heat Q produced by resistive heating is
proportional to the resistance of the material R and the square of the currentI
as shown by Joules law:
Q = Pt = I2Rt
where Q is in joules, power P is in watts or joules per second (Js1), I is in
amps, R is in ohms and t is in seconds (s). So we see that larger currents produce
more heat.
The resistance of a metallic conductor is described by the following equation:
l
R=
A
where is resistivity, l is length and A is cross-sectional area. This equation shows
that resistance in a wire is proportional to its length and inversely proportional to
its cross-sectional area. So, by increasing the diameter of the wire in the secondary
coil of the transformer (Figure 7.2.5), the resistance is decreased and this
minimises resistive heating.
resistive heating is one of the mechanisms
We have now identified that
responsible for energy losses in all transformers. The transformer shown in Figure
7.2.5 is a very common style of transformer in household appliances because it
minimises another loss mechanismflux leakage. The transformer in Figure
7.2.5 has a central iron core around which the primary and secondary coils are
The overall shape of the iron core acts to contain and direct the
wound.
magnetic flux from the primary coil through the secondary coil. This ensures that
the maximum amount of flux passes through the secondary coil and therefore
maximises induction in the secondary coil. If a significant amount of flux did not
pass through the secondary coil, then energy would not be transferred and could
be lost by other mechanisms.
secondary
coil
primary coil
139
Generators and
electricity supply:
power for the people
a
primary coil
increasing
magnetic
flux
secondary
coil
b
solid core
increasing
magnetic
flux
eddy currents
cross section
c
laminated
core
eddy current
cross section
The Sounds
of AC
140
low voltage
terminals
high voltage
terminals
oil tank
laminated core
low voltage coils
cooling
tubes
motors and
generators
Checkpoint 7.2
1
2
3
4
5
6
Identify a device that uses a transformer and outline the role the transformer plays in the operation of this device.
Define step-up and step-down transformers.
Explain why transformers do not work with DC electricity.
Calculate the voltage produced by a transformer for which np/ns is 0.01 and the input voltage is 12V.
Recall the law of conservation of energy and analyse its application to the operation of a transformer.
Identify the losses that occur in transformers and outline the design features that minimise these losses.
PRACTICAL
EXPERIENCES
Activity 7.2
141
Generators and
electricity supply:
power for the people
C
B
three phase
output
stator
N
A
S
neutral
Current or EMF
b
+
0V
B
10
20
30
40
Time
(ms)
142
motors and
generators
step-up transformers
GENERATION
POWER STATION
23 kV
50 Hz
330 kV or 500 kV
AC transmission
HIGH VOLTAGE
TRANSMISSION LINES
TRANSMISSION
TERMINAL
SUBSTATION
132 kV or 66 kV
AC transmission
step-down transformers
132 kV & 66 kV AC
SUB-TRANSMISSION
ZONE
SUBSTATION
11 kV
AC distribution
step-down transformers
11 kV AC
DISTRIBUTION
pole step-down
transformer
CONSUMER
household
single-phase
230 V AC
3 2
230 V
400 V
factory
three-phase 400 V AC
Generators and
electricity supply:
power for the people
insulator
Figure 7.3.6 A power pole in Marrickville, Sydney. Note that the four
main wires are not insulated. They are spaced far
enough apart to stop discharges occurring. They are
insulated from the pole by small ceramic insulators.
PHYSICS FEATURE
Transformers in the Home
In order to deliver electrical power efficiently across NSW the power loss between
The main
the generator and the customer must be as small as possible.
losses during transmission result from the resistance in the transmission wires and
the induction of eddy currents. First, lets look at losses due to resistance.
Using a relationship we have seen before:
P = I2R
we see that power loss P in transmission lines is proportional to the resistance R
and the current I squared. Therefore, if we double the resistance in transmission
wires we would double the power loss. But if we double the current we increase
So obviously, we want the resistance in
the power loss by a factor of four.
transmission wires as low as possible but, even more importantly, we want to use
a relatively small current. This is achieved using transformers to step-up AC to as
144
motors and
generators
high a voltage as practical. During this process the current is reduced to a
minimum.
Another way to illustrate why we want to reduce transmission current comes
from another relationship we saw earlier:
l
R=
A
where is resistivity, l is length and A is cross-sectional area.
This equation shows that the resistance R of a wire is proportional to its
length l. This means that, as the distance of transmission increases, the resistance
also increases. A typical resistance for one type of high-voltage transmission line
is about 0.4 ohms per kilometre. We can see that if large currents are used,
transmission will quickly become uneconomical over the hundreds of kilometres
typically required. A large proportion of the energy generated would be lost as
heat due to resistive losses. The best solution is again to minimise the current
being transmitted.
The two previous paragraphs highlight the importance of transformers in the
distribution of electricity. Transformers can readily step-up voltages and in doing
so they minimise the magnitude of the current involved. They can also step-down
the voltage when required to a value suitable for the consumer. This ability of
transformers has changed modern society enormously, and we will take a closer
look at this soon.
l
also shows us that the resistance of a wire is inversely
The equation R =
A
proportional to its cross-sectional area A. So making the diameter of the wire
larger will reduce its resistance. Unfortunately, thick copper wires are very heavy
and require larger structures to support them. Larger structures cost more money,
so a compromise is required. Aluminium is used as a conductor in high-voltage
transmission lines. Although it has a higher resistance than copper, it is much
lighter, so the wires have a larger diameter to reduce resistance without
being too heavy.
Losses during AC transmission also occur due to the formation of
eddy currents. The constantly alternating current in transmission lines
produces a constantly changing magnetic flux. This can induce eddy
To
currents in nearby conductors and energy will be lost.
minimise this, transmission lines are held at a distance from metal
transmission towers by insulators. Losses due to eddy currents also
occur in the cores of transformers. Recall that these losses are
minimised by constructing these cores from thin laminations of
magnetically soft iron.
PRACTICAL
EXPERIENCES
Activity 7.3
lightning protector
insulators
Transmission structures
A typical high-voltage AC transmission tower is shown in Figure
7.3.7. The three sets of wires carry the three phases generated in the
power station. As with lower voltage transmission (Figure 7.3.6),
high-voltage transmission lines are not coated with insulation.
These three sets of wires must be kept at large distances from
each other and from the metal tower. This not only minimises losses
due to eddy currents, it also minimises the chance of current
flowing through the air (electrical discharge) between conductors
or to the ground through the tower. In ideal conditions, 500kV
high voltage
transmission
lines
Generators and
electricity supply:
power for the people
Gather and analyse information
to identify how transmission
lines are:
insulated from supporting
structures
protected from lightning
strikes.
transmission lines must be approximately half a metre away from the tower that
supports them. In conditions in which the insulators are wet and covered in
pollution deposits, this distance must be greater. Typically, therefore, these wires
are held more than a metre from their supporting tower.
Figure 7.3.8 shows that insulators used for lines are typically long chains of
ceramic components. These components are disc shaped and have deep
corrugations on their underside. These features increase the distance any current
would have to travel in a discharge. The corrugations also minimise the chance
of pollution deposits settling on the surface and keeps some of the insulator dry
in wet weather.
High-Voltage DC Transmission
insulator
arcing horn
cement
ball
conductor
146
cup
porcelain
motors and
generators
city centre. It is not uncommon in modern society for people to commute
large distances to work.
People in most rural areas enjoy the same access to electricity as city dwellers.
This has improved their quality of life and allowed many people to remain in
country areas, who might have otherwise decided to move to the city.
The quality of life improved when electricity was affordable in every home.
This changed the way we live through the many appliances available. Many
of us find it hard to conceive a life without a refrigerator, television, DVD
player or computer.
Tasks performed by electrical machinery decreased the amount of unskilled
labour required and increased unemployment in certain parts of society.
PHYSICS FEATURE
Checkpoint 7.3
1
2
3
4
5
Explain how transformers allow the transmission of AC power over long distances.
Identify the role of transformers in household appliances.
Outline the energy losses in high-voltage transmission lines and the steps taken to minimise them.
Describe how transmission lines are insulated from supporting structures and protected from lightning strikes.
Assess the impact of transformers and AC generators on society and the environment. Justify your answer.
147
Generators and
electricity supply:
power for the people
PRACTICAL EXPERIENCES
CHAPTER 7
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Research how AC and DC generators work then place the information that you
have found into a table outlining the advantages and disadvantages of each. Use this
information to create a list of applications that would benefit from using each.
Discussion questions
1 Compare the structure of AC and DC motors.
2 List applications for which AC and DC generators are preferred and
explain your choice.
148
Chapter summary
motors and
generators
Review questions
Physically speaking
Copy and complete the table below by adding a definition for each key term.
Term
Definition
Step-up transformer
Generator armature
Generator
Substation
High-voltage insulator
149
Generators and
electricity supply:
power for the people
Reviewing
A
X
B
Look at Figure 7.4.1. Identify each of the parts that have been labelled.
3
4
Explain how you could tell the difference between an AC and DC motor.
Outline methods of reducing energy lost along the path from generation
to consumer.
a List the effects that generators have had on society and the environment.
b Assess these effects.
1
2
Create a table to compare and contrast the parts and function of an electric
motor and a generator.
a Identify where energy is lost during the distribution of AC from the time
the electricity is generated to when it reaches the household consumer.
b Describe the energy that is lost.
B
E
voltage Vp
voltage Vs
12
13
14 Show that Vp
Vs
15 Outline the energy transfers for a transformer that is not 100% efficient.
16 Determine how currents in primary and secondary coils of a transformer
are related.
18 Many appliances that are used in the home have a transformer attached to
their power cord. Explain the presence and need for these.
150
motors and
generators
19 Give examples of how transformers have impacted on the way in which
we live.
Solving Problems
22 Calculate the voltage in the secondary coil of a transformer that has 20 turns
in the primary coil and 300 turns in the secondary coil if the input is 240V.
100
320
240
50000
Coils in secondary
Step-up or step-down
200
50
30500
240
6
240
0.5
Current in secondary
coil (A)
Voltage in secondary
coil (V)
Turns ratio
240
2
12
0.1
1000
iew
Q uesti o
Voltage in primary
coil (V)
Re
Current in primary
coil (A)
151
Multiple choice
(1 mark each)
1 Two current-carrying wires of different lengths are
III
0.5 cm
152
Figure 7.5.1
0.8 m
IV
2m
Part
Role
a
b
Armature
Brushes
Commutator
Magnet
II
II
III
IV
Axis of
rotation
Force
Output
terminals
Current
Force
Current
Axis of
rotation
Output
terminals
Axis of
rotation
Output
terminals
Force
Current
Output
terminals
Axis of
rotation
Current
Force
motors and
generators
Extended questions
6
10
12
13
1 cm
0.5 A
1 cm
11
4c
4 cm
40
I
N
153
3
Context
Figure 8.0.1 Semiconductor integrated
circuits have produced laptop
computers that have much
more computing power than
even the most powerful early
military computers that used
thermionic devices.
154
from ideas to
Implementation
In just over 100 years we have gone from not knowing what constitutes matter to
understanding its subatomic structure. From the discovery of the mysterious
cathode rays came an understanding that these are fundamental particles that hold
solid matter together. Technology, such as X-rays, sprang from this discovery and
showed that their use in medical imaging enabled better patient diagnosis.
An understanding of the way electrons move in space and solids gave rise to the
plethora of electronic devices that have transformed our technology and way of life.
The control of electron motion in a vacuum resulted in early thermionic devices
(valves), such as the triode, to control the flow of current in electronic circuitry.
From these came television, radio, radar and the start of the modern electronic age.
The study of electron motion in solids resulted in miniature semiconductor devices
that replaced thermionic devices and enabled faster, more efficient and much more
sophisticated electronics, such as integrated circuits, to be made. These devices
are the backbone of all modern-day electronics, and include mobile phones and
computers, as well as electronics and instrumentation used in homes, hospitals,
industry, and in space.
The accidental discovery of superconductors (solid material through which
electrons travel unimpeded) may one day transform our technology yet again to
produce much faster computers and more efficient transportation, energy
production and transmission.
Inquiry activity
CDs: windows to viewing photons
Neon signs and most street lights use discharge tubes to create the many
different-coloured lights. In all of these, an electrical current is passed through
a gas at reduced pressure. The light from the glowing gas is characteristic of the
type of gas in the tube. A spectrometer is a device that separates the colours of
the spectrum of the gas by using the same principle of interference as early
research into X-ray diffraction. You can make your own spectrometer of visible
light at home simply by looking at the light reflected from the back of a CD
or DVD.
Ordinary light (from the Sun and an incandescent bulb) will show you the
continuous spectrum of white light that makes up the colours of the visible
spectrum. Wait until its dark and find lighting that does not use incandescent
lamps. Fluorescent tubes or bulbs are commonly available lighting that uses
electrical discharges. These are filled with mercury vapour and an inert gas.
1 Compare the spectrum of colours from a fluorescent lamp with that of an
incandescent lamp. Does it come in discrete colours or is it continuous like
the light from the incandescent lamp?
2 Use your spectrometer under the street lighting. Can you tell if this lighting
has the same gas as your fluorescent lamp or is it different? Or does it use
a mixture of gases?
155
From
cathode rays
to television
Mysterious rays
Try This!
Human-powered lighting
You can make a fluorescent light
tube temporarily give out flashes
of light by rubbing it quickly
with a piece of fur or wool in
a darkened room.
from ideas to
Implementation
near vacuum
We now know that this green glow is caused by electrons leaving the
cathode and striking the glass at high speeds, causing it to glow, but during
Geissler and Plckers time this was a complete mystery. The name cathode rays
was first used by Eugene Goldstein, a German physicist, who coined the name
because it appeared that unknown beams were being emitted by the cathode.
The tubes displaying this property came to be known as cathode ray tubes.
The next great challenge for physicists at the time was to find the composition
of these cathode rays from their behaviour.
cathode
anode
+
glass tube
switch
high voltage
(variable)
anode
mask holder
cathode
collimator
S
anode
magnet
157
From
cathode rays
to television
PHYSICS FEATURE
Anatomy of a discharge tube
anode dark
Aston
Crookes
space
nergy-efficient lighting, such as fluorescent
Faraday
tubes and neon signs, are known as discharge
+
tubes. These are evacuated glass vessels filled with
negative
gas at approximately 1% of atmospheric pressure
cathode
positive
anode
glow
and two electrodes at opposite ends of the tube.
glow
column
glow
A large potential difference between these
electrodes causes an electrical current to flow
Figure 8.1.5 A discharge tube showing the typical bright and
dark spaces
through the gas in the tube, resulting in colours
that depend on the type of gas and its pressure.
Bright regions in the discharge are areas where the
Figure 8.1.5 illustrates an operating discharge
electrons have sufficient energy to ionise the gas.
tube with the characteristic bright and dark spaces.
Some of the ions recombine with electrons, which
The Aston dark space next to the cathode is very thin
results in the emission of light. Not all atoms will be
and may go unnoticed. This is followed by the first
ionised. Some will simply have their electrons gain
luminous region called the cathode glow. Next is the
energy while remaining bound to the atom or
Crookes dark space and the negative glow (a luminous
molecule; this is known as excitation. All excited
region). The Faraday dark space follows this and is the
electrons will fall back to their previous energy state
largest of all the dark spaces. The largest luminous
and, in doing so, give out light. A minimum energy is
region, known as the positive column, follows and is
required to excite or ionise an atom or molecule.
the most prominent feature of the discharge. The
Many discharge characteristics are related to the
positive column may display periodic regions of bright
average
distance that electrons travel between
and dark spaces known as striations. Finally, there
colliding with the gas atoms or molecules. An increase
is the anode dark space and then the anode glow
in gas pressure leads to an increase in the density of
adjacent to the anode.
gas particles, and results in an electron travelling a
An operating discharge tube consists of a mixture
shorter distance between collisions, and vice versa.
of ions, electrons and neutral gas atoms. The
At low pressure, electrons have sufficient distance
discharge starts because energetic particles (such as
to accelerate and reach the required ionisation energy;
electrons and protons) continually stream from outside
however, they may strike the sides of the vessel more
the Earth and strike the surface and atmosphere.
often than they strike the gas. A minimum number of
This so called cosmic radiation can strike gas atoms
ionisations are required to sustain the avalanche of
and produce free ions and electrons in the discharge
electrons that maintain the discharge. Below a
tube. An electric field causes the free electrons to
minimum pressure, it becomes difficult to start an
gain sufficient energy to ionise gas atoms and
electrical discharge along the tube.
produce further free ions and electrons, which in
turn will accelerate to produce more ionisation. This
avalanche of ionisation is the way an electrical
+ Figure 8.1.6 Ions that strike the
discharge is started.
cathode liberate
e
the discharge.
continually ejected from the cathode surface by the
+
bombardment of ions that are attracted to it. These
secondary electrons cause further ionisation, which
PRACTICAL
results in further ion bombardment of the cathode
EXPERIEN
CES
surface, and the process repeats.
Activity 8.1
158
from ideas to
Implementation
159
From
cathode rays
to television
cathode
anode
collimator
paddle wheel
+ + ++ +
+
cathode rays are not deflected,
and behave in the same way as light
Checkpoint 8.1
1
2
3
4
5
6
7
8
160
PRACTICAL
EXPERIENCES
Activity 8.2
from ideas to
Implementation
dimensions. The greatest repulsive or attractive force is in the region of the lines
with the closest spacingnear the charge in this case.
There are some guidelines for drawing electric field lines for two or more
charges:
The lines must begin on a positive charge and/or end on a negative charge.
Larger charges have more lines starting or ending on them.
Lines cannot cross.
Lines are always at right angles to a conducting surface.
The electric field lines for a pair of opposite and equal point charges are shown
in Figure 8.2.2. The number of field lines leaving the positive charge is equal to
the number of lines ending on the negative charge. The lines are radial at very
close distances to the charges. These lines are more closely spaced near the charges
to indicate higher electric field strength in that region.
The electric field lines between two equal charges q of the same sign are shown
in Figure 8.2.3. The region at the centre, between the charges, is where the
electric field strength is zero, because the electric fields of the two charges cancel
each other.
The field lines between two charged parallel metal plates of opposite sign but
Equally spaced and parallel
equal magnitude are shown in Figure 8.2.4.
lines indicate that the field is equal in magnitude and direction and is said to be
uniform. The field lines start to curve near the edges and become unevenly
spaced, indicating a non-uniform field (called the edge-effect).
b
negative
positive
Figure 8.2.3 Electric field lines around (a) two equal positive charges,
and (b) two equal negative charges
E
+ + + + + + + + + + + positive
161
From
cathode rays
to television
energy required to move an object in the direction of the force is known as work
and is defined as:
Work = force distance = W = F d
(see in2 Physics @ Preliminary section 4.3).
In this case, the displacement d is the separation of the parallel plates.
Rearranging this formula as follows:
W
F =
d
and substituting this expression for force in the expression for electric field given
previously, we obtain:
W
E=
qd
Recall also that the work done on charges is related to the potential difference
V by the following expression:
W=qV
(see in2 Physics @ Preliminary section 10.8).
Substituting this into the previous expression for electric field strength,
we obtain:
q V
E=
qd
E=
Thus, the electric field strength E is easily calculated from the plate separation
d and the potential difference between them V. From this equation, you can see
that an alternative unit for the electric field strength is volts per metre (V m1).
Worked example
Question
e
E
+ + + + + + + + + + +
L
V
d
Two parallel plates are separated by d = 1.0 cm and have a side length of L = 2.0 cm
(Figure 8.2.6). A potential difference of V=10V is applied between them.
An electron enters halfway between the plates with a velocity of v0 = 2.7 106 ms1
parallel to the plates.
Calculate:
a the electric field strength E between the plates
b the vertical force F experienced by the electron as it travels between the plates
c the acceleration of the electron
d the time it takes for the electron to travel the full length of the plates
e the vertical displacement of the electron just as it exits the plates
f the change in kinetic energy of the electron between its entry to and exit from
the plates.
Note that the electron charge e = 1.6 1019 C, and its mass me = 9.11 1031 kg.
Solution
a The electric field strength E is given by:
V
10
E= =
= 1000 V m1 (or N C1 )
d 0.010
162
from ideas to
Implementation
b The vertical force is given by:
F = e E = 1.6 1019 1000 = 1.6 1016 N upwards
c The acceleration can be obtained from Newtons second law (see section 3.4 of
in2 Physics @ Preliminary):
a=
F
1.6 1016
=
= 1.76 1014 m s2 = 1.8 1014 m s2 downwards
me 9.11 1031
d The motion of the electron is similar to projectile motion (see Module 1 Space).
There is only acceleration in the vertical direction. We are concerned with the
horizontal component of the velocity, which remains constant. Therefore, the time t
to travel the length L of the plates is given by:
t=
L
0.020
=
= 7.4 109 s
v 0 2.7 106
e The electron only accelerates vertically, so we use the acceleration from part c and
one of the SUVAT equations (see in2 Physics @ Preliminary section 1.3) to work out
the vertical displacement s given by:
1
s = ut + at 2
2
where u is the initial vertical velocity = 0 ms1, t is the time to travel the length of
the plates (from part d), and a is the vertical acceleration = 1.8 1014 ms2 (from
part c). Therefore, the displacement is given by:
1
s = 0 + 1.8 1014 (7.4 109 )2 = 0.0049 m = 4.9 mm
2
f The change in kinetic energy is due to the vertical deflection of the electron by the
electric field. This is work done by the field on the electron, and is given by:
W = e V
where V is the change in potential that the electron experiences due to its vertical
deflection s. The relationship between potential difference and the displacement s
is given by:
V = E s
Checkpoint 8.2
1
2
3
Explain the meaning of the arrows and spacing of lines when drawing electric field lines.
Calculate the electric field strength at the location of a charge of 1.28 1018 C that experiences a force of
1.1 1018N.
Sketch the electric field lines around:
a a point positive charge
b two oppositely charged parallel metal plates.
163
From
cathode rays
to television
r
F
+
+q
mv 2
= qvB sin
r
where r is the radius of curvature of the particle.
Rearranging this expression, we obtain the following expression for the radius:
mv
r=
qB sin
c
b
+
v sin
v
+
B
v
v cos
Figure 8.3.2 (a) A proton enters a magnetic field at an angle . (b) The component of the velocity at right angles to the field undergoes circular motion.
(c) The total motion is helical along the magnetic field.
164
from ideas to
Implementation
Worked example
Question
Protons are occasionally ejected from the Sun at high speeds towards the Earth. These can be
caught in the Earths magnetic field and are trapped along magnetic field lines in a region
known as the Van Allen belt. A proton travelling at a speed of v = 1.0 107 m s1 strikes the
Earths magnetic field at an angle of 30 at a distance of 3000 km above the surface where
the magnetic field B = 3.5 105 T. Assume that the Earths magnetic field at this distance
is uniform. The charge on a proton is 1.61019 C and it has a mass of 1.671027 kg.
a Calculate the magnitude of the force on the proton.
b Calculate the radius of curvature of the proton.
c Determine if the cyclotron motion of the proton will cause it to come in contact with
the Earth.
Solution
a The magnitude of the force is given by:
F = qvB sin = 1.6 1019 1.0 107 3.5 105 sin 30 = 2.8 10 17 N
mv
1.67 1027 1.0 107
= 6.0 103 m
=
qB sin 1.6 1019 3.5 105 sin 30
c No, the protons cyclotron motion will not cause it to come in contact with the Earths
surface, since the cyclotron radius is 6 km and the proton is 3000 km above the surface.
Checkpoint 8.3
1
2
Outline what will happen to a negative charge moving to the left at right angles to a magnetic field that is
directed out of the page.
Calculate the magnitude of the force experienced by an electron that travels at right angles to a magnetic field
of 2T at a speed of 3 ms1.
165
From
cathode rays
to television
magnetic field
electromagnet
cathode rays
large
voltage
fluorescent
screen
cathode
anodes
electromagnet
charged plates
The parallel plates deflected the beam vertically upwards by placing a positive
charge on the top plate. A magnetic field (using an electromagnet outside the tube)
was applied at right angles to the electric field and the direction of the beam. The
direction of the magnetic field was such that it deflected the beam downwards.
Adjusting the magnetic and electric forces such that they were equal and
opposite resulted in the beam passing through undeflected.
We now quantitatively describe this experiment and how it leads to the charge
to mass ratio of the electron.
Equating the electric to the magnetic force on a particle with a charge q and
speed v in a magnetic field B and an electric field E, we obtain the following
expression:
Magnetic force = electric force
qvB = qE
Thomson was able to calculate the radius of curvature r from the deflection of
the beam on the fluorescent screen when the electric field was switched off and
magnetic field switched on. He could calculate the magnitude of the electric field
E because he knew the spacing d of the plates and the potential difference
V between them, and made use of the relationship E=V/d. Finally, knowing the
number of turns in the wires of the electromagnet and the current flowing through
it, he was able to calculate the magnitude of the magnetic field B.
Thomson found that the charge to mass ratio always came to:
q
= 1.76 1011 C kg 1
m
regardless of the cathode material, indicating that a fundamental particle was
being emitted. This, in essence, marks the discovery of the electron. In 1891,
the Irish physicist George Johnstone Stoney (18261911) suggested that the
fundamental unit of electricity be called an electron6 years before Thomsons
publication of his now famous experiment.
166
from ideas to
Implementation
Checkpoint 8.4
1
2
167
From
cathode rays
to television
anode (positive)
electrons
'boil' off
the heated
cathode
collimator
electron
beam
heater
cathode (negative)
electrons attracted
to the positive anode
Figure 8.5.3 The components of an electron gun used in both cathode ray
oscilloscopes and CRT televisions
Time
Time
sinusoidal vertical voltage
Television
electron gun
electron
beam
magnetic
coils
fluorescent screen
168
Cathode ray tube (CRT) television sets used the principles of the cathode ray
tube for most of the 20th century. These are now being superseded by plasma
and liquid crystal display television sets, which use different operating principles
and allow a larger display area with a sharper image. However, the CRT
television holds quite a significant historical place in this form of
communication.
A schematic diagram of a colour CRT television set is shown in Figure 8.5.5.
Its basic elements are similar to those of the CRO. The main difference is the
method of deflecting the electrons. Magnetic field coils placed outside the tube
produce horizontal and vertical magnetic fields inside it. The magnitude and
direction of the current determine the degree and direction of electron beam
deflection. Recall your right-hand palm rule for the force on charged particles
in a magnetic field. The vertical magnetic field will deflect the electrons
horizontally; the horizontal field will deflect them vertically.
The picture on the screen is formed by scanning the beam from left to right
and top to bottom. The electronics in the television switches the beam on and
off at the appropriate spots on the screen in order to reproduce the transmitted
picture. However, to reproduce colour images, colour television sets need to
control the intensity of red, blue and green phosphors on the screen. Three
separate electron guns are used, each one aimed at one particular colour. The
coloured dots on the screen are clustered in groups of red, blue and green dots
that are very close to each other and generally cannot be distinguished by eye
without the aid of a magnifying glass. For this reason a method of guiding the
different electron beams to their respective coloured dots was devised. A metal
sheet, known as a shadow mask (Figure 8.5.6) and consisting of an array of
holes, is placed behind the phosphor screen. Each hole guides the three beams to
their respective coloured phosphor as the beams move horizontally and vertically.
Black and white television sets did not need the shadow mask since they had
only one beam.
from ideas to
Implementation
glass
deflecting
coils
mask
electron
guns
blue
beam
red
beam
R
G
B
vacuum
mask
phosphor dots
on screen
fluorescent
screen
green
beam
focusing
coils
fluorescent
screen
Try this!
Do not adjust your
horizontal!
If you have access to an old
black and white TV set or an old
style monochrome computer
monitor, try holding a bar
magnet near the front of the
screen and watch how the
image distorts. This occurs
because the magnetic field
deflects the electrons that strike
the screen. DO NOT do this with
a colour TV set. This can
magnetise the shadow mask and
cause permanent distortion of
the image and its colour. You
can move a bar magnet near the
back of a colour TV set to
deflect the electrons from the
electron gun and therefore
distort or shift the image
without causing permanent
damage to the TV set.
holes in
mask
electron
beams
Figure 8.5.6 A colour CRT television set has three electron guns
that will only strike their respective coloured
phosphor dots with the aid of a shadow mask.
Checkpoint 8.5
1
2
3
4
From
cathode rays
to television
PRACTICAL EXPERIENCES
CHAPTER 8
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Connect a set of discharge tubes that are at different pressures to an induction coil
or high voltage power supply and observe the different striation patterns. The
patterns are hard to see unless the room is very dark.
Equipment: induction coil, connecting wires, discharge tubes at different
pressures, DC power supply.
Discussion questions
1 Draw a labelled set of diagrams showing the distinct patterns that occur
during the evacuation of the tube.
2 Describe the change in the striation pattern with changing pressure.
170
Connect the induction coil or high voltage power supply to each of the cathode ray
tubes. Observe the behaviour of the electron beam under the influence of a
magnetic or an electric field. Note the result of placing a Maltese cross or a paddle
wheel in the path of the electron beam.
Equipment: induction coil, connecting wires, cathode ray tubes in the
following set-ups: Maltese cross, magnetic field (you can use a permanent
magnet), electric field, paddle wheel, DC power supply.
Discussion questions
1 List which experiment supported the idea that cathode rays were waves and
which supported the particle theory. State the observation that led to the
particle or wave conclusion.
2 Identify the properties of cathode rays that were determined from these
experiments.
Chapter summary
from ideas to
Implementation
Review questions
Physically Speaking
Reviewing
The items in the columns are not in their correct order. Copy
the table and match each of the definitions to the apparatus.
Apparatus
Definition
Oscilloscope
Discharge tube
CRT TV
Maltese cross
171
From
cathode rays
to television
solving Problems
CRT
a force of 3.2 1016N.
phosphor experience SED
electron
phosphor
electron
13 Calculate the work
done in order to move an object
gun
emitter
through 10cm with a force of 3N.
deflecting
yoke
spacer
100 V
10 mm
0V
Re
iew
172
Q uesti o
from ideas to
Implementation
PHYSICS FOCUS
Where to from here?
luminescence
black
matrix
colour filter
glass substrate
electrode
Va
phosphor
Va
field
emission
nanogap
scattering
several nm
173
Electromagnetic
radiation: particles
or waves?
The wave nature of light
PRACTICAL
EXPERIENCES
Activity 9.1
174
A suggestion about the nature of light waves came from the Scottish physicist
James Clerk Maxwell (18311879). From his studies of Michael Faradays
experiments with electricity and magnetism, he derived four fundamental
Remarkably, these
equations that linked electricity and magnetism.
equations predicted that oscillating electric charges should produce a wave that
travels through space at the speed of light. This wave consisted of oscillating
electric and magnetic fields at right angles to each other, and was called an
electromagnetic wave.
Although Maxwells equations did not conclusively prove that light was an
electromagnetic wave, he strongly suspected that it was, because the predicted
speed of these electromagnetic waves was the same as the speed of light.
Moreover, his equations predicted that all electromagnetic waves travelled at the
speed of light, regardless of their frequency. Sadly, he died at age 48 before his
theory could be tested experimentally.
In 1887 Heinrich Hertz (18571894) was the first to experimentally verify
Maxwells electromagnetic wave theory. Hertzs apparatus was essentially a radio
wave transmitter and receiver. Today, almost all of us carry out a more
from ideas to
Implementation
sophisticated version of Hertzs experiment when we tune in to a radio station.
More importantly, we are able to select different radio stations by their
Hertz needed to make an antenna that transmitted
transmitting frequency.
at a specific frequency and a receiver that was tuned to that frequency. This was
not such an easy task when electronics were not available.
Hertz used a number of different transmitters and receivers during the course
of his experiments. Each successive apparatus was refined to give more accurate
results. The basic principle of Hertzs apparatus is shown in Figure 9.1.1.
The transmitter consisted of a pair of metal rods placed end to end with a small
gap between them. He used an induction coil to place charges of opposite signs
on these rods at very large potential difference, causing a spark to jump across the
gap. This caused electrical current to oscillate back and forth across the gap and
along the rods, thus producing an electromagnetic wave. The frequency of the
oscillation was determined by the dimensions of the rods. His first apparatus
produced a frequency of 50 million cycles per second, but he had no way of
The mathematics for calculating the frequency was known
measuring that.
during Hertzs time and so he was able to calculate this oscillation frequency.
The electromagnetic wave emitted by the transmitter was detected by one of
several receivers. The receiver shown in Figure 9.1.1 is a loop of wire with a gap.
The natural oscillating frequency of this loop had to match the frequency
of the transmitter. The dimensions of the loop and gap determined this frequency
and were accurately calculated by Hertz. The receiver showed that a spark jumped
across the gap even if it was placed many metres away from the transmitter, thus
indicating that an electromagnetic wave has been transmitted in air.
plate
rod
spark gap
to induction coil
metal loop
TRY THIS!
Any experiment in a thunderstorm
A
N
A
N
Electromagnetic
radiation: particles
or waves?
PHYSICS FEATURE
The evolution of Hertzs
experiment
Hertzs first apparatus
produced an
electromagnetic wave
with a wavelength of
6 m. There are
difficulties with
precision experiments
when using such a long
wavelength. You need to
be far enough away from
the transmitter so that
you are not simply
detecting a spark in your Figure 9.1.3 Heinrich Hertz was a
receiving antenna due to
skilled experimental
physicist.
purely electrostatic effects
as a result of being too
close to the high-voltage induction coil. Moreover, the
spark in the detecting antenna becomes very weak if
the distance between it and the transmitter is too
great. You have to contend with reflections from the
walls of the room and the objects within it. You must
also face the problem of tuning the transmitter and
receiver to the same frequency when there is no
instrument that can help you. Heinrich Hertz
overcame all of these issues.
Figure 9.1.4 shows the evolution of the different
types of transmitters and receivers used by Hertz. The
first transmitter consisted of zinc spheres at the ends
of the two rods (see Figure 9.1.4a). The antenna could
be tuned by sliding the spheres along the rods. The
receiver was a square loop of wire with a gap. The
second type of antenna, was tuned by changing the
area of the flat plates at the end of the rods (see
Figure 9.1.4b). The receiver was a circular loop with a
gap. The first two antennas produced a wavelength of
6 m. The next generation of transmitter and detector
(Figure 9.1.4c) was a smaller antenna and a similar
receiver (no loop) for a wavelength of 0.66m.
176
transmitters
a
receivers
6m
6m
0.66 m
to induction coil
transmitting
antenna
receiving
antenna
from ideas to
Implementation
distance gave the wavelength . Hertz was able to calculate
the frequency f of the transmitted radio waves. The speed of
these waves c was then determined using the following
well-known wave speed formula:
c = f
Hertz used a different frequency and found that the
speed of the waves remained the same. Although this didnt
prove that the speed of these radio waves was the speed of
light, it was strong supporting evidence of Maxwells theory
1m
2m
3m
4m
5m
6m
7m
8m
Distance from the wall
that the speed of all electromagnetic radiation was the same.
Moreover, the speed of these waves was exactly the measured
Figure 9.1.6 Standing electromagnetic waves produced by reflection
speed of light.
from a large zinc plate enabled Hertz to measure the
Hertz also showed that the path of these waves
speed of light.
could be bent in the same way as light (refraction) by passing
them through a large asphalt prism. (See in2 Physics @
Preliminary section 8.3.)
Hertz showed that the electric and magnetic fields
Heinrich Hertz: Oops
of the radio waves had a unique direction in space, known as
polarisation. The electric field in Figure 9.1.6 points vertically.
ertz initially published his estimate of
the speed of electromagnetic waves as
When the detecting loop was at an anti-node and the direction
2 108 ms1. This relied on the accuracy of his
of the gap aligned with the electric field (vertically), then a
calculation of the oscillation frequency of the
spark jumped across the gap. When the loop was rotated so
transmitter. It was an embarrassing moment when
that the gap was at right angles to the electric field, there was
Poincar, a great French mathematician, wrote him
no spark.
a letter pointing out that he had not included a
It took great skill for Hertz to show that polarised
factor of the square root of 2 in his frequency
electromagnetic waves exist and have a finite speed equivalent
calculation. As a result, his actual measured speed
to the speed of light. Moreover, these waves shared other
was 2.8 108 ms1, which is only 7% off the
properties with light such as reflection and refraction. This set
actual value of 3108ms1. So dont feel so bad
the scene for the emergence of radio communications and
next time you make a numerical mistakeeven
hence the modern field of telecommunicationsmobile phone
great scientists do it.
technology is a sophisticated version of Hertzs experiment.
Hertz did all of this before his life was cut short at the age of
36. One of the greatest honours that can be bestowed upon a scientist
is to name a unit of measurement after them. The international unit of
frequency is no longer called cycles per secondit is known as the hertz.
Checkpoint 9.1
1
2
3
4
5
6
7
8
9
Describe Maxwells contribution to understanding the connection between electricity and magnetism.
Describe an electromagnetic wave.
Recall the two predictions that Maxwell made about electromagnetic waves.
Describe how Hertz produced and controlled the frequency of electromagnetic waves.
Outline the requirements needed of the receiver in order for it to detect any given frequency.
Recall the definition of a standing wave.
Sketch a diagram of a standing wave in a rope, labelling a node and anti-node.
Outline how Hertz manipulated the equipment in order to determine the wavelength of the electromagnetic wave
produced.
Define polarisation and explain how Hertz showed that electromagnetic waves were polarised.
177
Electromagnetic
radiation: particles
or waves?
PRACTICAL
EXPERIENCES
Activity 9.2
TRY THIS!
Sun power
Use a magnifying glass to
focus the Suns rays to a point
on a wad of tissue paperit
should catch fire. Now colour
another tissue paper with black
ink, say from a felt-tip pen or
by dipping it in black ink.
Let it dry. Focus the Suns rays
onto this blackened tissue.
You should find that the paper
catches fire much more
quickly. This is because black
absorbs more of the incoming
radiation and therefore heats
up more quickly.
ultraviolet visible
infra-red
10
8
Intensity I (arb.units)
6
max
T = 6000 K
4
5000 K
max
4000 K
2
3000 K
1.0
Wavelength (m)
2.0
Figure 9.2.2 The intensity of light from a hot object is dependent on wavelength.
178
3.0
from ideas to
Implementation
These assumptions were radical ones since classical physics said that objects
can have any energy. Max Plancks theory defied this assumption. Incredibly,
Planck himself resisted this theory wholeheartedly for many years and thought
of it as nothing more than a mathematical convenience. However, this idea of the
quantised nature of energy heralded the start of quantum physics, which, together
with relativity, became the foundations of the modern physics we use today.
classical theory
Radiance
the classical theory approach. They assumed that the walls of the cavity were
made from tiny oscillators that emit electromagnetic wavesjust as Hertz had
Although this theory was able to
assumed for his transmitting antenna.
reproduce the shape of the graph for large wavelengths (see Figure 9.2.4), it failed
badly at the shorter wavelengths. Here the calculated intensity increased towards
infinity, which violated the law of conservation of energy. This was called the
ultraviolet catastrophe, because this started to occur at the ultraviolet end of the
radiation spectrum.
Max Planck (18581947) solved this problem in 1900 and was able to
mathematically reproduce the black body radiation curve by making the following
radical assumptions:
The emitters in the walls of the cavity can only have energies E given by
E = nhf
where f is the emitted frequency in hertz, h is a constant (now called the
Planck constant = 6.63 1034Js) and n is an integer. The energy E is
measured in joules.
The emitters can absorb or radiate energy in jumps, or quanta. Two
consecutive energy states of an emitter differ by hf.
experiment and
Planck theory
2
3
Wavelength (m)
c = f
a
y
E
B
z
photon with
energy hf
c
x
z
x
c
179
Electromagnetic
radiation: particles
or waves?
PHYSICS FEATURE
Max avoids a catastrophe
Figure 9.2.6
Max Plancks radical
hypothesis avoided the
ultraviolet catastrophe
and started a new field
of physics called
quantum physics.
from ideas to
Implementation
Einstein assumed that the energy E of the photon is related to its frequency
f by the equation:
E = hf
where h is Plancks constant = 6.63 1034Js. In classical theory, the intensity
was determined by the electric field of the electromagnetic wave. In the new
quantum theory, it is the number of photons that determines the intensity.
He then used this idea to explain peculiar properties of metals when they are
irradiated with visible and ultraviolet light, known as the photoelectric effect.
Worked example
question
A laser pointer emits light with a wavelength of 6.50 107 m. The power of the laser is
1.00 103 W.
Assume that Plancks constant h = 6.63 1034 Js, and the speed of light
c = 3.00 108ms1.
a Calculate the energy in each photon of the laser beam.
b Calculate the number of photons emitted each second.
Solution
a The energy E in a photon is given by
E = hf
where h is Plancks constant and f is the frequency of the light. We can calculate
the frequency from the following:
c = f
Therefore:
E =h
c
3.00 108
= 6.63 1034
= 3.06 10 17 J
6.50 109
b The laser has a power of 1.00 103 W. This means it emits 1.00 103 joules per
second. The number of photons per second is given by:
number of photons per second =
power
1.00 103
=
energy per photon 3.06 1017
Checkpoint 9.2
1
2
3
4
5
6
Describe what happens to the wavelength of light emitted as the temperature of an object is increased.
Define a black body.
Outline how classical theory described black body radiation.
Explain the ultraviolet catastrophe.
Outline the assumptions Planck used to solve the problem.
Explain Einsteins revolutionary thoughts on light.
181
Electromagnetic
radiation: particles
or waves?
light
electrons
gst
en
pla
tin
um
tun
zin
pot
ass
sod ium
ium
f0
f
W
0.5
1.0
1.5
In his experiments on radio waves, Hertz noticed that the sparks were easier
to produce in the gap of the detector loop whenever it was directly exposed to the
ultraviolet light from the spark of the transmitting antenna. Placing an ordinary
glass plate between the transmitter and the detector reduced the brightness of the
spark, because ordinary glass blocks ultraviolet light. Replacing the glass plate with
a quartz glass plate caused the spark to be more easily produced, because quartz
glass allows ultraviolet light to be transmitted. He concluded that ultraviolet light
has an effect on the receiving loop, but he was not in a position to explain it.
However, he realised that it was an extremely important phenomenon and stopped
his wave research in order to study it. Hertz had discovered the photoelectric effect.
The photoelectric effect is the ejection of electrons from the surface of a
polished metal when light is shone on it. The metal on either side of the gap
of Hertzs detector loop emitted electrons when struck by ultraviolet light, which
assisted in producing the spark. Ironically, the results of this experiment were used
by Einstein to prove the particle nature of light, in contrast to Hertz who wanted
to prove its wave nature.
The photoelectric effect was studied by other physicists who used a cathode ray
tube, as shown in Figure 9.3.1. The tube either had a quartz window or was made
of quartz, which allowed ultraviolet light to pass through and strike the cathode,
known as a photocathode. This caused electrons, known as photoelectrons,
to leave the cathode and be collected by the anode, thus registering a current on
the ammeter.
The photoelectric effect was considered to be strange because:
No electrons are ejected from the metal below a certain frequency no matter
how intense the light. This is called the cut-off frequency f0. This is in
contrast to classical theory, which says that all electromagnetic waves have
energy and, if you wait long enough, all electrons can gain energy and leave
the surface.
The kinetic energy of the electrons increases as the frequency of the incident
light increases (that is, going from red to blue to ultraviolet and beyond) as
shown in Figure 9.3.2. However, there is no change in the electron energy if
the frequency of the incident light is constant but the intensity is increased.
Increasing the intensity simply increases the number of electrons but their
energy remains the same. From classical wave theory, an increase in intensity
should result in an increase in the energy of the electrons.
There is no delay between the time the light is shone on the surface and the
time the electrons are emitted, no matter how dim the light source. From
classical theory, electrons would require a length of time to gain enough energy
from a low-intensity light source in order to escape the surface of the metal.
Einstein explained the photoelectric effect by assuming that an electron is
ejected when it absorbs a photon that has energy E = hf. Although electrons
involved in electrical conduction in a metal are free to move around, they are still
bound to the metal as a whole. Energy is needed to separate them from the metal.
This energy is called the work function W. The value of W depends on the type
of metal. The photoelectric effect is mainly a surface phenomenon, so the surface
must be free of oxide films, grease or other surface contaminants. Einstein then
simply conserved energy by stating that:
from ideas to
Implementation
nte
ractiv
M o d u le
PRACTICAL
EXPERIENCES
Activity 9.3
Current
Worked example
question
A zinc photocathode is irradiated with ultraviolet light. A voltage of 2.0 V is required to stop
the photoelectrons.
a Calculate the maximum kinetic energy of the photoelectrons. Use the electron charge
e = 1.6 1019 C.
b Calculate the wavelength of the ultraviolet light striking the photocathode, given the
work function of zinc W = 6.88 1019 J and Plancks constant h = 6.63 1034 J s.
Solution
a The maximum photocathode kinetic energy Kmax is given by:
Kmax = eVstop = 1.6 1019 2.0 = 3.2 1019 J
b Use Einsteins photoelectric effect formula: Kmax = hf W
where Kmax is obtained from part a, h is Plancks constant and the zinc work function
W = 6.88 1019 J. In order to calculate the wavelength we first require the frequency,
which is obtained by manipulating Einsteins formula:
f=
Electromagnetic
radiation: particles
or waves?
Checkpoint 9.3
1
2
3
4
Classical prediction
5
6
Explain how the work function helps explain the measured energy of photoelectron.
Outline the significance of the stopping voltage.
anode
dynode
incoming
light photon
photo cathode
electron
184
9.4 Applications of
the photoelectric effect
The photoelectric effect presents a convenient way of producing an electrical
signal from light. The apparatus used to produce the traditional photoelectric
effect (a vacuum tube with a photocathode and anode), called a photocell
(photoelectric cell), was not of practical use outside the classroom for
demonstrating the photoelectric effect. The current that was produced was far
too small for most applications.
A modification to this tube was made by placing a series of specially coated
surfaces, called dynodes, between the photocathode and the anode (see Figure
9.4.1). This is known as a photomultiplier tube. The dynodes are at successively
increasing positive voltage with respect to the cathode, so they attract the
electrons emitted by the photocathode. Each incident electron causes many other
electrons to be emitted from the dynode surface. This effect is multiplied at the
next dynode, and so on until a large number of electrons reach the anode and
register a relatively large current. This makes the photomultiplier tube extremely
sensitive to low levels of light, such that it is possible to detect individual
photons. The material from which the photocathode is made has been improved
and it can eject electrons from incident light ranging from the ultraviolet to the
infra-red range of the electromagnetic spectrum.
This high sensitivity of the photomultiplier tube makes it suitable for the
detection of small changes in light intensity. This has opened up many
applications in astronomy, nuclear physics, blood testing and medical imaging.
One method of medical imaging is called positron emission tomography
(PET). One use of this method is the imaging of tumour cells within the body.
The patient is injected with a short-lived radioactive chemical that contains
molecules that attach themselves to particular tissue types such as tumour cells.
This chemical emits two very high energy photons (known as gamma photons)
that exit the body in opposite directions and strike a ring of photomultipliers
around the patient (see Figure 9.4.2). Each photomultiplier has a material in
from ideas to
Implementation
front of it, known as a scintillator, that gives out flashes of light when the gamma
photons strike it. The flashes of light are converted to electrical signals by the
photomultiplier tubes. Computers calculate the time of arrival of the individual
gamma photons and are able to locate their point of origin in the body. In this
way, an image of the tumour cells can be generated from the different pairs of
gamma photons arriving from different locations.
Another use of the photoelectric effect is in night-vision devices used by
the military to see in almost total darkness by using the few photons coming
from stars.
If we broaden our definition of the photoelectric effect to mean any process
by which light is converted to electricity, without the need to eject an electron
from a surface, then we can include many devices made from semiconductor
materials. Although the electron stays within the solid, it becomes free from the
atom to which it was bound, when struck by a photon. However, one important
use of this type of photoelectric effect is the conversion of sunlight into electrical
energy in solar cells. More detail will be given on this topic in Chapter 10
Semiconductors and the electronic revolution.
detectors
positron-electron
collision
gamma rays
created
Checkpoint 9.4
1
2
3
PRACTICAL EXPERIENCES
from ideas to
Implementation
CHAPTER 9
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Perform an investigation to
demonstrate the production
and reception of radio waves.
Discussion questions
1 Explain how your experiment differs from the one carried out by Hertz.
2 Describe what you notice as you move further away from the coil.
185
Electromagnetic
radiation: particles
or waves?
PRACTICAL EXPERIENCES
Activity 9.2: Black body radiation
Part A: Gather information about how the photoelectric effect is used in photocells.
Part B: Set up an electrical circuit to investigate the energy that a photon of light
gives to an electron. You can carry out the experiment first hand or use a computer
simulation.
Discussion questions
1 List devices that can be classified as photocells and explain how their
working principle is related to the photoelectric effect.
2 Use the graph of electron energy (y-axis) and photon frequency (x-axis)
to calculate the Planck constant.
Gather information about the differing views held by Planck and Einstein and use
it to write an informed paragraph on your views.
Discussion questions
1 Outline Einsteins and Plancks view of scientific research.
2 List political pressures on Planck and Einstein during the times of their
main discoveries, and assess the influence of these pressures on Plancks
and Einsteins research.
Chapter summary
186
from ideas to
Implementation
Review questions
Physically speaking
Reviewing
1 State the significance of Maxwells equations.
2 Outline the purpose of Hertzs experiments.
3 Outline the procedure followed by Hertz to determine
the speed of radio waves.
Solving problems
19 Calculate the energy of a photon of red light with
a wavelength of 656 nm.
0.75
1.0
f (1015 Hz)
iew
Q uesti o
0.25 0.50
Re
187
10
Semiconductors
and the electronic
revolution
Electronics transforms the world
from ideas to
Implementation
Energy
Energy
Energy
conduction band
energy gap
valence band
189
10
Semiconductors
and the electronic
revolution
a
c
conduction band
overlap
energy gap
valence band
Figure 10.1.4 Energy band diagrams for (a) a conductor, (b) a semiconductor and (c) an insulator
The type of atom, the bonding between atoms and the number of valence
and inner electrons of an atom are some factors that determine the type of
energy band diagram for the resulting solid. This is the area of physics known as
solid state physics, which relies heavily on quantum physics. Figure 10.1.4a
shows an energy band diagram for conductors. There is no clear energy gap and
therefore no distinct conduction or valence band. We can say that the
conduction band is partially filled and therefore the solid contains free electrons.
The energy band diagrams for a semiconductor and an insulator are similar
The much smaller energy gap of the semiconductor
(Figure 10.1.4b, c).
distinguishes it from an insulator. The valence band of an insulator is full and,
due to the relatively large size of its energy gap, none of the electrons in the
valence band have enough energy to reach the conduction band. However, some
electrons in the valence band of the semiconductor can acquire enough thermal
The
energy to jump across the small gap into the conduction band.
number of conduction electrons that result is much less than for a conductor, so
a semiconductor is a much poorer conductor than a metal but a better conductor
than an insulator such as plastic or glass.
Checkpoint 10.1
1
2
3
4
5
Describe how gases, which usually have insulating properties, can be made to conduct electricity.
Describe why the energy levels of two neighbouring atoms are not exactly the same.
State why only electrons are charge carriers in solids.
Define the terms electrical conductor and insulator.
Describe how it is possible for electrons in the valence band of semiconductors to reach the conduction band.
10.2 Semiconductors
The size of the energy gap determines whether a solid is a semiconductor
or an insulator. A convenient unit for measuring the energy gap is the electron
volt (eV). This is a unit of energy and is related to the joule:
1 eV = 1.6 1019 J
Most semiconductors have an energy gap less than 5 eV. Insulators have
greater energy gaps. Examples of energy gaps for semiconductors and insulators
are given in Table 10.2.1. The most common semiconductors are silicon (Si) and
germanium (Ge).
190
from ideas to
Implementation
Some electrons in the valence band of a semiconductor will gain
enough energy from the ambient thermal (heat) energy in the solid to
jump up to the conduction band. This occurs readily at normal room
temperature. Heating a semiconductor further increases the number of
electrons jumping from the valence band to the conduction band, thereby
making the solid more conducting. For example, heating a piece of silicon
will lower its resistance. At absolute zero (T = 0K), there is no thermal
energy available for the electrons, all of which remain in the valence band;
therefore, at absolute zero the semiconductor behaves as an insulator.
Semiconductor crystal
TRY THIS!
Si
Ge
InP
GaP
GaAs
CdS
CdTe
ZnO
ZnS
Diamond
1.14
0.67
1.35
2.26
1.43
2.42
1.45
3.2
3.6
5.4
Semiconducting pencil
Graphite is a form of carbon that is used in lead pencils (there is
no lead in pencils, just graphite and clay, but we will refer to it as
the lead of the pencil). Graphite also behaves as a semiconductor.
Obtain a length of pencil lead (about 10 cm) and attach each end
to the probes of a digital ohm meter. It should measure units of
ohms. Now heat it (with, say, a match) and observe that the
resistance decreases. The resistance may decrease by up to 0.5
and it will be more for thinner leads.
This decrease in resistance is due to an increase in the number
of electrons jumping from the valence band to the conduction band.
The resistance will increase once more when the lead cools.
conduction
band
narrow
energy gap
v
Doping
At normal room temperature (300 K) the average kinetic energy of an electron is
about 0.026eV. However, the energy gap of silicon is 1.14eV. It would appear
that the electrons dont have enough energy to jump to the conduction band, so
why is silicon, or any of the other semiconductors listed in Table 10.2.1, slightly
conducting at room temperature? The answer is that 0.026 eV is an average
energy. This means there are some electrons that have a much lower energy and
valence
band
holes
10
Semiconductors
and the electronic
revolution
A hole in
a bottle
a
+4
+4
+4
+4
+4
+4
+4
+4
+4
+4
+4
+4
+4
+5
+4
+4
+4
+4
+4
+4
+4
+4
+3
+4
+4
+4
+4
electron
192
hole
some that have a much higher energy. It is these higher energy electrons that
make it across the gap, but there are not many of them.
Physicists have found that the conductivity of semiconductors can be
improved by introducing impurities into the crystal lattice, a process that is
called doping. In order to understand how an impurity atom can improve the
conductivity, we must examine what happens at the crystal lattice level.
We will use silicon as an example, but similar explanations can be applied to
all other semiconductors. Silicon belongs to group 4 of the periodic table, which
means that the silicon atom has 4 electrons available for bonding with other
silicon atoms (see Figure 10.2.2a). Silicon can be doped with phosphorus P by
replacing a silicon atom with a phosphorus atom. Phosphorus is in group 5 of the
periodic table and has 5 electrons available for bonding, but it can only bond
with 4 atoms in the silicon crystal lattice. This leaves one electron unbonded (see
This semiconductor is called an n-type semiconductor.
Figure 10.2.2b).
(n represents the negative charge of the unbonded electron.) Impurities that
produce unbonded electrons are called donor impurities. These unbonded
electrons are weakly attached to their atoms and can easily be moved into
the conduction band by ambient heat, and thus can contribute to an electrical
current.
Similarly, silicon can be doped with boron, which belongs to group 3 and
therefore has 3 electrons available for bonding with 3 atoms of silicon. There is
no electron available for the fourth silicon atom to bond with and, as a result,
a vacancy in charge has been created, which we call a hole. An electron from a
neighbouring silicon atom can now jump across to fill this vacancy (see Figure
10.2.2c). This, in turn, creates a vacancy for the atom it left behind, and so can
be viewed as the movement of the hole. As already mentioned, a hole behaves
like a positive charge since it moves in the opposite direction to the electron. The
semiconductor behaves as though it has positive charge carriers and is therefore
called a p-type semiconductor. Impurities that produce a charge vacancy are
called acceptor impurities.
Note that both n- and p-type semiconductors remain electrically
neutral even though there is an unbonded electron or a vacancy is created. The
neutral impurity atoms were added to an already neutral solid, so the net charge
remains zero.
Both n- and p-type doping improve the conductivity of a semiconductor by
creating an excess of negative and positive charge carriers respectively. The
unbonded electron in n-type semiconductors needs a small amount of energy to
move to a different location. Similarly, very little energy is required to cause a
neighbouring electron to occupy the vacancy of a p-type semiconductor.
The energy-band diagram of an n-type semiconductor is shown in Figure
10.2.3. A new energy level has been created in the energy gap just below the
conduction band. This level is occupied by the unbonded electrons of the
impurities and is called the donor energy level. This level is very close to the
conduction band. The difference in energy between the two levels is usually
0.026 eV or less, and so most of the electrons in the donor level are able to jump
up to the conduction band at room temperature, making the semiconductor
much more conducting than the undoped crystal.
Figure 10.2.2 The silicon lattice (a) has no doping, (b) has been doped with phosphorus
and (c) has been doped with boron.
from ideas to
Implementation
The energy-band diagram for a p-type semiconductor in Figure 10.2.4 shows
that there is an extra energy level, called an acceptor energy level, near the
valence band. This level is full of vacancies (holes), allowing the valence electrons
to jump up and occupy these vacancies and therefore contribute to an electrical
current. The gap between the acceptor level and the donor (valence) level must
be equal to or less than 0.026 eV for conduction to take place at room
temperature.
A semiconductor is labelled as extrinsic if conduction is dominated by donor
or acceptor impurities. Otherwise, it is known as an intrinsic semiconductor.
conduction band
conduction band
holes
acceptor impurity
energy level
valence band
valence band
n-type semiconductor
a p-type semiconductor
PRACTICAL
EXPERIENCES
Activity 10.1
Checkpoint 10.2
1
2
3
4
5
6
Distinguish between a semiconductor and an insulator in terms of the size of the energy gap.
Describe what is meant by a hole.
Describe doping in semiconductors.
Explain the difference between n-type and p-type semiconductors.
Identify elements that can be used to make n-type and p-type semiconductors.
Distinguish between acceptor and donor energy levels.
The pn junction
A pn junction is a p-type material joined onto an n-type material. In reality,
the pn junction is made from a continuous single crystal in which the
concentration of impurities has been made to change abruptly from p-type to
n-type as we cross the junction. However, to clarify the physics of the junction
region, we will assume that the p-type and n-type crystals are initially separated
and then brought together.
The pn junction is used as the most basic electronic device, the diode.
It allows current to move in only one direction, as shown in Figure 10.3.1.
193
10
Semiconductors
and the electronic
revolution
V
forward
bias
A
+
b
p
forward
bias
O
reverse bias
electric
field
depletion
layer
p-type
n-type
E
E
free
electrons
free
holes
fixed
negative
ions
fixed
positive
ions
194
The conventional current easily flows from p- to the n-type material when the
positive terminal of a power supply is connected to the p-type end of the device.
Reversing the connections of the power supply (positive connected to n-type and
negative to p-type) results in a very tiny reverse current, the current is essentially
stopped. The symbol for the diode is an arrow that indicates the direction of the
conventional current when the positive and negative terminals of the power supply
are connected to the p- and n-type materials respectively (see Figure 10.3.1b).
A key to understanding the physics of a pn junction is to first examine
the concept of diffusion. Free particles are in continuous random motion and
move from areas of high to areas of low concentration. For example, a drop of ink
placed in a beaker of water results in the ink particles slowly spreading outwards so
that the water becomes uniform in colour. This occurs because the point of origin
of the free particles is very small (drop of ink) compared with the rest of the
medium (beaker of water). So it is more likely that the particles will move to the
rest of the medium. This is diffusion and it is a property of all randomly moving
particles. The energy for the movement of these particles (kinetic energy) comes
from the ambient thermal energy. At a temperature of absolute zero there will not
be any diffusion.
Similarly, the holes in the p-type material and the electrons in the n-type
material are in constant random motion due to their thermal energies, resulting in
a uniform density of the charge carriers across their respective materials. When the
two crystals are brought into contact, electrons from the n-type crystal will
naturally want to diffuse into the p-type crystal, because the electron density of its
conduction band is lower. Conversely, the holes in the p-type region want to
This results in an
diffuse into the n-type crystal, as shown in Figure 10.3.2.
electrical current, which is quickly stopped because an electric field builds up from
the n to the p region, due to this charge separation, and stops the further flow of
charge. A large part of this diffusion region results in the combination of electrons
with holes, leaving it depleted of charge. This is known as the depletion region.
However, there continues to be an excess of electrons and holes on the p and n
sides respectively.
A depletion region is like two parallel plates with equal and opposite
charges on them, resulting in an electric field in the space between them, which
contains no charge.
The energy that created the electric field essentially came from the
thermal energy of the charge carriers. At a temperature of absolute zero (0 K),
there will not be enough energy for diffusion to occur and therefore no electric
field and no depletion region will be created.
Electrons that diffused across and combined with holes cannot easily
drift back since they lost energy as they fell into the holes. Similarly, holes that
diffused combine with electrons from the n-type material. The trapping of the
diffused holes and electrons has resulted in the conversion of thermal energy into
the electrostatic energy stored in the electric field region.
As shown in Figure 10.3.3a, connecting the positive and negative
terminals of a battery to the p and n sides of the junction respectively creates an
electric field opposing that in the depletion region. This lowers the junction
electric field strength and therefore allows further charge to diffuse across the
junction. A thin pn junction will enable the excess diffused charge to reach the
from ideas to
Implementation
metal terminals on either side of the junction, thus resulting in the flow of
current in the outside circuit. This type of connection to the pn junction is
known as forward bias. This also has the effect of reducing the width of the
depletion region.
Conversely, connecting the negative and positive terminals of a battery
to the p and n sides respectively, as shown in Figure 10.3.3b, increases the
electric field strength in the junction region. This will cause some of the diffused
electrons to travel back to the n region, setting up a small reverse current. This
type of connection is known as reverse bias. It also has the effect of increasing
the width of the depletion region.
Rectifiers
input
output
C
hole
flow
p-type
electron
flow
n-type
E
electron flow +
temporary
hole flow
electron flow
temporary
wide
depletion layer electron flow
p-type
n-type
E
+
voltage smoothed
narrow
depletion layer
where Plancks constant h = 6.63 1034 Js, f is the frequency, c is the speed
of light and is the wavelength. Manufacturers of LEDs can control the
wavelength of the emitted light by controlling the size of the energy gap. Most
remote-control units around the home have an LED on the end that you point
towards a device. The LED looks like a very tiny plastic light blub, but works
very differently. The wavelength of these LEDs is usually in the infra-red range
of the spectrum and so is not visible to the eye. However, it is visible to most
types of digital cameras such as those in mobile phones and video cameras.
PRACTICAL
EXPERIENCES
Activity 10.2
light emission
conduction
band
E = h
valence
band
hole flow
hole and
electron
recombine
p-type
electron flow
n-type
electron
flow
E = h
electron
flow
195
10
Semiconductors
and the electronic
revolution
PRACTICAL
EXPERIENCES
Activity 10.3
PHYSICS FEATURE
n-type layer
(semiconductor)
light absorption
conduction
band
absorbed
photon
substrate
junction
p-type layer
(semiconductor)
valence
band
new hole and
electron created
p-type
hole flow
electron
flow
current
n-type
electron flow
load
resistor
electron
flow
(current)
electron
flow
back
contact
freed electrons
from ideas to
Implementation
Try this!
seeing infra-red
Grab a remote control that you use for the TV set or DVD player.
Look at the LED at the end of the remote control while pressing
any button (for example, the volume or channel button). You should
not see anything happening. Now point the remote control at your
mobile phone camera or any other digital camera. While looking at
the screen of the camera, press any button on the remote control.
You should now see the LED flashing, because the camera is
sensitive to the infra-red light from the LED but your eye is not.
Figure 10.3.8 The infra-red light from the LED of a remote control has been
made visible by the digital camera that took this photograph.
Checkpoint 10.3
1
2
3
4
5
6
7
10
Semiconductors
and the electronic
revolution
electrons
anode
electrons
cathode
heater
Thermionic devices
electrons
A
anode
electrons
grid
cathode
heater
heater
(filament)
cathode
grid
anode
+150 V
The transistor
output
0V
198
from ideas to
Implementation
The most common transistor is the bipolar (or junction) transistor. A schematic
diagram of a bipolar transistor is shown in Figure 10.4.5. This transistor works in
a similar way to the original transistor of Bardeen and Brattain, but its structure is
very different.
The bipolar transistor (Figure 10.4.5) consists of a p-type semiconductor
sandwiched between two n-type semiconductors. A lead (or wire) is connected
to each layer of the material. The two outer layers are called the emitter and the
collector; the central layer is called the base. This is often called an npn
transistor. The width of the base has been exaggerated for clarity, but is usually
very thin.
An electron current will only flow from the emitter to the collector if there is
a potential on the base with respect to the emitter (see Figure 10.4.6). Recall that
in a pn junction there is an electric field that points from n to p. Applying a
positive voltage on the base of an npn transistor with respect to the emitter
reduces the size of this field to almost zero. This then allows electrons from the
emitter to move into the base region. These electrons are then accelerated into the
collector region by the electric field at the junction between base and collector.
The electrons that moved into the collector can then flow into an external circuit.
Note that no current will flow without applying the potential to the
base. The current to the base is much smaller than the current that it allows to
flow from emitter to collector. Thus the transistor, like the triode, is a current
amplifier. The circuit given in Figure 10.4.7 has a small AC voltage applied to the
base of an npn transistor, which causes a much larger current to flow into an
external resistor between the emitter and the collector. The voltage across the
resistor is larger than the input signal but varies in the same way. This circuit is a
basic amplifier. Bipolar transistors also can be made in the pnp configuration.
electrons
electrons
n
collector
p
base
collector
electrons
emitter
small
current
emitter
base
collector
emitter
base
+
Vbase
large
current
+
Vcollector
load
resistor
amplified
output
input
+
+
Vbase
Vcollector
199
10
Semiconductors
and the electronic
revolution
PHYSICS FEATURE
Integrated circuits
drain (+)
insulating
layer
n
few electrons
can pass
substrate (p)
silicon
dioxide
film
n-type
p-type
substrate
Checkpoint 10.4
1
2
3
4
5
6
200
PRACTICAL EXPERIENCES
from ideas to
Implementation
CHAPTER 10
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Perform an investigation to
model the behaviour of
semiconductors, including the
creation of a hole or positive
charge on the atom that has lost
the electron and the movement
of electrons and holes in
opposite directions when an
electric field is applied across
the semiconductor.
Discussion questions
1 Determine the angle (with respect to the direction of the Sun) of the solar
cell that delivers the greatest amount of power to the motor, or the greatest
voltage.
2 Determine the effect on power output of different wavelengths of light by
placing, for example, coloured cellophane in front of the solar cell.
201
10
Chapter summary
Semiconductors
and the electronic
revolution
Review questions
Physically speaking
Reviewing
p-type
n-type
Semiconductor
Energy bands
Depletion zone
Conductor
Valence band
Forbidden gap
Insulator
Thermionic devices
Solid state
Triode
Diode
Transistor
Solar cell
Holes
Electrons
Doping
7
8
202
from ideas to
Implementation
10 Explain how a triode can be used to control the
amount of current flowing in a circuit.
Solving problems
14 A 1 milliwatt (103 W) red laser pointer outputs
Vo
iew
Q uesti o
Figure 10.5.1
Re
203
11
Superconductivity
Surprising discovery
Figure 11.1.1 Crystal structure of sodium chloride. The red spheres represent positive
sodium ions, and the green spheres represent negative chlorine ions.
204
from ideas to
Implementation
of interference of electromagnetic radiation, and examine how this was applied to
crystals using X-rays. Then we will see how the BCS theory of superconductivity
made use of the crystal structure of matter.
Try This!
Checkpoint 11.1
t=0s
t=4s
Figure 11.2.1 Two identical waves (red, green) travelling in opposite directions can add (blue)
t=1s
The interference of identical waves from two sources can also be represented
by outwardly radiating transverse waves (see Figure 11.2.3). The distance that a
twave
= 3 s travels is known as its path length. t = 6 s Constructive interference occurs
when the difference in the path length of the two waves is equal to 0, , 2, 3,
4 or any other integer multiple of the wavelength . Destructive interference
occurs when the two waves are half a wavelength out of step. This corresponds to
t=4s
t=7s
a path length difference of /2, 3/2, 5/2 etc.
t=5s
waves
in phase
lines of destructive
interference
lines of constructive
interference
constructive
interference
t=6s
destructive
interference
constructive
interference
t=7s
11
Superconductivity
Light also has wave properties and produces interference effects when it is
passed through two narrow and closely spaced slits (see Figure 11.2.4). Each slit
acts as a source of light waves that are in step with each other. The resulting
constructive and destructive interference pattern appears as bright and dark
bands on a screen.
The position of the bright bands can be determined by applying the
condition that the path length difference between the two waves must be an
integer multiple of wavelengths (see Figure 11.2.5). This can be expressed by
the following equation:
screen
double
slit
m = dsin
where m = 1, 2, 3 , d is the spacing between the slits and is the angular
position of the bright bands. The central bright band corresponds to m = 0.
The first band on either side of it corresponds to m = 1, and so on.
Interference also occurs when many slits are
constructive
used (see Figure 11.2.6). The bright bands
interference
become narrower and sharper on the edges, with
y
increasing number of slits. In practice, the slits
S1
L
S2
slit 1
d
slit 2
d sin
waves
in phase
waves
in phase
TRY THIS!
Diffraction grating in my stereo
A commonly available diffraction grating is the humble CD
or DVD. It has thousands of tiny and closely spaced pits
that cause ordinary white light to break up into the colours
of the rainbow because there is an angle for constructive
interference for each wavelength contained in white light.
A more clearly defined diffraction pattern can be made by
shining a low power (less than 1 mW) laser pointer on the
CD or DVD so that the beam is reflected to a nearby wall.
You should notice that there are fainter dots near the
reflected dot. These fainter dots arise from constructive
interference of the laser light.
waves
in phase
destructive
interference
constructive
interference
from ideas to
Implementation
position can be accurately determined. The angular positions of the bright bands
Knowledge
are determined by the formula given above for the double slit.
of the wavelength can be used to determine the slit separation and vice versa.
To obtain an interference pattern, the distance between slits must be
close to the wavelength of the light falling on them. The interference pattern
from a diffraction grating is also called a diffraction pattern.
Checkpoint 11.2
1
2
3
anode
vacuum
X-rays
heavy
metal
target
metal rod
(removes
heat and
electrons)
cooling fins
11
Superconductivity
Checkpoint 11.3
X-rays
1
2
crystal
3
lead
collimator
photographic
film
incident
angle
reflection
angle
d sin
208
from ideas to
Implementation
From measurements of the diffraction angle , the Braggs were able to determine
the wavelength of X-rays. This needed an accurate knowledge of the spacing d
between planes in the crystal. However, this spacing was not known accurately,
since there was no method available to measure it. How can you measure
without knowing d and vice versa?
The Braggs solved this dilemma by using the diffraction pattern to determine
the arrangement of the atoms, without the need to know the space between them.
For example, it is possible to know that a crystal has a cubic arrangement from a
Laue diffraction pattern, without the need to know the spacing of atomic planes
Knowledge of the lattice geometry, the density
or the wavelength of X-rays.
of the crystal (mass/volume) and the mass (in grams) of the individual atoms
enables the spacing between atoms to be calculated accurately. This calculated
spacing d can now be used with the X-ray diffraction angle , to determine the
wavelength from the Bragg diffraction equation. Today, we simply use the
database of known crystal lattice spacings to determine X-ray wavelengths.
Checkpoint 11.4
1
2
3
11
Superconductivity
Anything that disturbs the regularity of the lattice results in electron collisions
and contributes to the electrical resistance of the crystal.
In practice, metals do show electrical resistance, as evidenced by the increase
in temperature of the metal wire in an electric heater or the heating element
on a stove. Resistance in metals originates from collisions of electrons with
irregularities in the crystal lattice. These can be caused by lattice vibrations, or
impurities (a foreign atom substituted for one in the crystal lattice) and defects
of the lattice (such as a missing atom) (see Figure 11.5.2). The crystal lattice of
all metals above a temperature of 0K consists of waves of lattice vibrations
known as phonons. A phonon colliding with an electron causes it to lose energy
and thus contributes to electrical resistance.
Real metal wires consist of many small crystals joined together and separated
by irregular boundaries. The boundaries also serve as places where electron
collisions take place and thus contribute to electrical resistance.
a
Figure 11.5.2 Electrical resistance is caused by electron collisions due to crystal lattice
(a) impurities, (b) defects and (c) vibrations.
he kelvin temperature scale differs from the Celsius scale by the following
expression:
kelvin = Celsius + 273.15
Checkpoint 11.5
1
2
3
4
210
from ideas to
Implementation
Hg
0.10
0.05
0.00
4.1
4.2
4.3
Temperature (K)
4.4
Table 11.6.1 Some superconductors, their critical temperatures and critical magnetic fields
Material
Critical temperature
(kelvin)
Type I
Tungsten
0.02
0.0001
(elements)
Titanium
0.4
0.0056
Aluminium
1.18
0.0105
Tin
3.72
0.0305
Mercury ()
4.15
0.0411
Lead
7.19
0.0803
Type II
NbTi alloy
(compounds
and alloys)
NbZr alloy
10.8
11
PbMo6S8
14.0
45
V3Ga
16.5
22
10.2
12
Nb2Sn
18.3
22
Nb3Al
18.9
32
Nb3Ge
23.0
30
Type II
YBa2Cu3O7
92
(high-temperature
ceramic
compounds)
Bi2Sr2Ca2Cu3O10
110
Tl2Ba2Ca2Cu3O10
125
HgBa2Ca2Cu3O8
135
Electrical resistance
Class
superconductor
normal
metal
TC
Temperature (K)
11
Superconductivity
Checkpoint 11.6
1
2
3
Ideal
conductor
Superconductor
Checkpoint 11.7
Explain what happens to a magnetic field passing through an ideal conductor and a superconductor when the conductors
are cooled to below their critical temperatures.
Checkpoint 11.8
1
2
Bc
External field Ba
Internal field Bi
Internal field Bi
from ideas to
Implementation
Bc1
Bc2
External field Ba
11
Superconductivity
induced
shielding
current
permanent
magnet
high-temperature
superconductor
image of
permanent
magnet
this, we need to expose a little secret used when demonstrating this levitation
experiment. If the magnet is lightly placed over a newly cooled high-temperature
superconductor, you should find that the magnet does not stay levitated for long
and falls off very quickly. To get the magnet to stay, hold the magnet over the
superconductor and, rather than letting it go, thrust it slightly towards the
superconductor. This is a subtle movement and usually goes unnoticed by the
audience. Now release the magnet and it will remain there stably. Incredibly, if the
magnet is then removed and dropped back over the superconductor, it levitates
stably without the need to thrust the magnet towards the superconductors. It is as
if the superconductor has remembered that the magnet was there. Moving the
magnet back and forth parallel to the surface of the superconductor or allowing
the superconductor to warm up above Tc and then cooling it down again will
make the levitation of the magnet unstable once more. The magnet must again be
thrust towards the superconductor to achieve stability. This is explained in the
following section.
Ba
600
from ideas to
Implementation
Checkpoint 11.9
1
2
3
4
a
e
b
+ + e
+ +
c
+ +
+ +
215
11
Superconductivity
PHYSICS FEATURE
High-temperature superconductors: The exceptions to the rule
216
from ideas to
Implementation
goes on to form Cooper pairs with other electrons. The end result is that each
electron in the solid is attracted to every other electron, forming a large network
of interactions. Causing just one of these electrons to collide and scatter from
atoms in the lattice means the whole network of electrons must be made to
collide into the lattice, which is energetically too costly. The collective behaviour
of all the electrons in the solid prevents any further collisions with the lattice.
In this case, the
Nature prefers situations that spend a minimum of energy.
minimum energy situation is to have no collisions with the lattice. A small
amount of energy is needed to destroy the superconducting state and make it
normal. This energy is called the energy gap.
In addition to having a linear momentum, each electron behaves as if it is
spinning. This property, not surprisingly, is called spin. (The electron is not
The BCS theory requires
actually spinning, but behaves as though it does.)
that the spins of Cooper pair electrons be in opposite directions.
Checkpoint 11.10
1
2
3
Medical applications
The first large-scale commercial application of superconductivity was in
magnetic resonance imaging (MRI). This is a non-intrusive medical imaging
technique that creates a two-dimensional picture of, for example, tumours and
other abnormalities within the body or brain. This requires a person to be placed
inside a large and uniform electromagnet with a high magnetic field. Although
normal electromagnets can be used for this purpose, their resistance would
11
Superconductivity
dissipate a great deal of heat and have large power requirements. Superconducting
magnets, on the other hand, have almost no power requirements apart from that
required for cooling. Once electrical current flows in the superconducting wire,
the power supply can be switched off because the wires can be formed into a loop
and the current will persist indefinitely, as long as the temperature is kept below
the transition temperature of the superconductor.
Superconductors can also be used to make a device known as a superconducting
quantum interference device (SQUID). This device is extremely sensitive to small
magnetic fields and can detect magnetic fields from the heart (1010 tesla) and
even the brain (1013 tesla). For comparison, the Earths magnetic field is about
104 tesla. As a result, SQUIDs are used in non-intrusive medical diagnostics of
the brain.
Process information to discuss
possible applications of
superconductivity and the
effects of those applications on
computers, generators and
motors and transmission of
electricity through power grids.
218
Scientific research
The traditional use of superconductors has been in scientific research requiring
high magnetic field electromagnets. One application of powerful superconducting
electromagnets is in high-energy particle accelerators, such as the Large Hadron
Collider at CERN (see section 15.4), in which beams of protons and other
particles are accelerated to almost the speed of light and made to collide with each
other to create more elementary particles. It is expected that this research will
answer fundamental questions such as those about the origin of the mass of matter
that makes up the universe.
A future use of superconducting electromagnets is in nuclear
fusion energy generation using plasmas. A plasma is a fully ionised gas
that is obtained by heating it to millions of degrees and trapping it
inside a toroidal structure known as a tokamak by large
electromagnets. The nuclei of the ions fuse together, producing
energy. The operating gases of such reactors are deuterium and
tritium, the isotopes of hydrogen. Deuterium is abundant in water,
but tritium will be made inside the tokamak as a by-product of fusion
reactions. There is no long-term radioactive waste with this process,
which is why it is known as clean nuclear energy. Currently an
international research plasma reactorthe International
Thermonuclear Experimental Reactor (ITER)is being built. The
aim of the project is to demonstrate that energy production is possible
with this method. A diagram of the projected reactor is shown in
Figure 11.11.2. The magnetic field coils on the ITER will be made
from superconductors and will need to only be powered once; the
current through them will be sustained indefinitely, as long as the coils
are kept cool.
Levitating trains
Magnetic levitation (maglev) trains have been built that use powerful electromagnets made from superconductors. The superconducting electromagnets are
mounted on the train and kept cool with liquid helium. As shown in Figure
11.11.3, normal electromagnets on a guideway beneath the train repel (or attract)
the superconducting electromagnets to levitate the train while pulling
it forwards. The superconducting electromagnets rely on the conventional
like-pole repulsionnot the Meissner effectto achieve levitation.
from ideas to
Implementation
Although such a superconducting maglev train has been built and has
demonstrated a top speed of 581kmh1, there are several issues that limit its
widespread commercial use. A strong magnetic field inside the train will exclude
passengers with pacemakers or devices that have magnetic data storage including
computers and credit cards. The powered conventional electromagnets on the
guideway that levitate and propel the train are expensive to run over long
distances, so alternative propulsion schemes may have to be used.
superconducting
levitation magnet
superconducting
propulsion
magnet
guideway
propulsion
magnet
vehicle
gliding
skid
guidance
and
braking
levitation
and
propulsion
magnet
armature
windings
(iron core)
guideway
PRACTICAL
EXPERIENCES
Activity 11.1
Checkpoint 11.11
1
2
219
11
Superconductivity
PRACTICAL EXPERIENCES
CHAPTER 11
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Perform an investigation to
demonstrate magnetic
levitation.
Analyse information to explain
why a magnet is able to hover
above a superconducting
material that has reached the
temperature at which it is
superconducting.
Gather and process information
to describe how superconductors
and the effects of magnetic
fields have been applied to
develop a maglev train.
Process information to discuss
possible applications of
superconductivity and the
effects of those applications on
computers, generators and
motors and transmission of
electricity through power grids.
220
Chapter summary
from ideas to
Implementation
Review questions
Physically Speaking
Reviewing
Concept
Definition
Superconductivity
Crystal lattice
Laue pattern
Phonon
Bragg law
Tc
Meissner effect
Type-II superconductor
Cooper pairs
221
11
Superconductivity
Solving Problems
19 Calculate the separation of diffraction grating slits
when a heliumneon laser (=633nm) is shone on
it. The angular position of the second-order bright
band is 30.
20
Resistance ()
1000
800
600
400
200
0
disadvantages of superconductors.
Re
222
iew
Q uesti o
50
100
150
Temperature (K)
Figure 11.12.2
200
250
from ideas to
Implementation
PHYSICS FOCUS
Semiconductors to
Superconductors
1950s
1960s
1970s
1980s
Silicon transistor
8-bit Microprocessor
32-bit Microprocessor
1 transistor
16 transistors
4500 transistors
275000
transistors
1990s
2000s
3100000
transistors
592000000
transistors
Figure 11.12.3 Moores law states that the number of transistors on a chip doubles every 2 years.
Since the introduction of the commercial transistor in
the late 1940s, the size and processing ability of
computer chips has changed dramatically. According
to Moores law, the number of transistors on a chip
doubles every 2 years. But there is a limitthe
physical size of atoms.
In the distant future, a new generation of
computers that make use of quantum physics may
overcome the size limitations of current (classical)
computer technology. Quantum computers allow for
processes to happen simultaneously rather than
sequentially as in classic computers. For example, to
factorise a 400 digit number would take a quantum
computer a few minutes, but it would take current
computers billions of years to do the same calculation.
Another distant technology that may speed up our
current limitations on computation speed are the
superconducting switching devices known as
Josephson junctions. Although not the only technology
being researched for future computers, this is one
to watch.
Research
10 Research and outline how the Josephson junction
works.
11 Research and describe the possible technologies
that can take over from semiconductors.
12 Outline the role of nano-carbon tubes in speeding
up processing ability.
223
Multiple choice
(1 mark each)
1 Predict the direction of the electron in Figure 11.13.1
as
A
B
C
D
II
III
Conductor
Insulator
Insulator
Semiconductor
Insulator
Conductor
Semiconductor
Conductor
Semiconductor
Semiconductor
Conductor
Insulator
A
B
C
D
224
Figure 11.13.1
A
B
C
D
II
III
Critical
temperature
Superconductor
material
Critical
temperature
Normal material
Superconductor
material
Critical
temperature
Normal material
Normal material
Superconductor
material
Figure 11.13.2
Normal material
Superconductor
material
Critical
temperature
II
II
Figure 11.13.3
III
Energy bands
Resistance ()
III
Temperature (K)
from ideas to
Implementation
4
III
C
D
10
11
12
a
b
II
A
B
Extended response
II
III
Striations
Faradays
dark space
Crookes
dark space
Cathode
Cathode
Striations
Anode
Cathode
Anode
Faradays
dark space
Striations
Faradays
dark space
225
4
Context
Figure 12.0.1
226
Quanta to
Quarks
We all talk about quantum leaps, but did you know that Max Planck made the first
quantum leap in 1901 when he introduced the idea of the quantisation of energy?
At the beginning of the 20th century, scientists were confronted by an accumulation
of experimental observations and explanations that lacked unification. Black body
radiation, the photoelectric effect, radioactivity and the emission of sharp spectral
lines by atoms in a gas discharge tube could not be adequately explained within the
framework of Newtonian classical physics.
A new physics, quantum physics, was born. The story has some inspiring
characters and storylines: how Niels Bohr synthesised the works of Planck, Einstein
and Rutherford and proposed the now commonly recognised RutherfordBohr
atomic model, and so provided an explanation for spectral lines; and how Louis de
Broglie in 1924 took Plancks idea, reversed it and proposed a totally crazy idea that
all matter has wave properties, which, in turn, gave birth to quantum mechanics.
The investigations into radioactivity led to the artificial manufacture of elements
and the dawn of the atomic age. The development of particle accelerators often
referred to as atom smashers in the media, led to the discovery of a particle zoo,
as a plethora of new subatomic particles were identified. Today, the quest to
understand the building blocks and forces of nature continues, with the building of
the Large Hadron Collider (referred to as the LHC), which is designed to produce
conditions that mimic the environment present just after the birth of the universe.
Inquiry activity
A chain reaction
Obtain some scrap paper! Gather together a crowd of peopleyour class will do
the more the better! And make sure you have safety glasses for all! Screw up the
paper into hundreds of balls, each about the size of a ping-pong ball. Make sure
everyone has at least six balls of paper and then gather everyone in close together.
These are the rules of the chain reaction game.
1 Safety glasses on at all times!
2 If you are hit by a paper ball you throw two balls of paper high up into the air.
Your teacher can lob in the first paper ball. It makes a great video clip!
You can also vary the rules. For example, try throwing one paper ball when you
are hit rather than two. Have fun!
227
12
From Rutherford
to Bohr
Pieces of a jigsaw!
Ernest
Rutherford
nce, a distinguished
stranger, amazed by
his unscholarly accent and
appearance, mistook him
for an Australian farmer.
At the University of
Manchester, Rutherford
would proclaim to his
recruits that: all science
is either physics
or stamp collecting.
Figure 12.1.1
228
quanta to
quarks
Table 12.1.1
Atomic timeline
Democritus
(c460 bcec.370 bce)
He was a Greek philosopher who proposed that there was a limit to how
small one could divide matterthe smallest indivisible particle was
called an atom (atomos, Greek meaning without slices or indivisible).
Aristotle
(384 bce322 bce)
John Dalton
(17661844)
Henri Becquerel
(18521908)
JJ Thomson
(18561940)
Ernest Rutherford
(18711937)
Niels Bohr
(18851962)
Louis de Broglie
(18921987)
James Chadwick
(18911974)
Experimental
surprise
Checkpoint 12.1
Outline the main atomic models proposed between ancient times and 1913.
~1010 m
electron
Figure 12.2.1
positively
charged
material
229
12
From Rutherford
to Bohr
Structure
Limitations
Despite the success of Rutherfords planetary atomic model in explaining the
scattering of alpha particles, the model failed to explain other important
questions:
What is the nucleus made of?
How are the orbits of the electrons arranged around the nucleus?
What keeps the negatively charged electrons from losing energy and spiralling
into the positive nucleus?
a
++
+ +
+
b
nucleus
Figure 12.2.2
Scattering deflections
predicted by (a) Thomsons
and (b) Rutherfords atomic
models
pa
rticle
b
metal
foil
particle
+
nucleus
Figure 12.2.3
230
quanta to
quarks
Checkpoint 12.2
1
2
Explain why Thomsons plum pudding model only predicted small deflections in alpha particles passing through
a thin metal foil.
Outline the significance of Marsden and Geigers scattering experiments to the development of the Rutherford
atomic model.
E = hf
Plancks assumption suggests that the energy of any vibration could only be
a whole number multiple of hf:
E = nhf
where n = 1, 2, 3, E is energy (in joules), n is called a quantum number, h is
Plancks constant (with a value of 6.63 1034 Js) and f is the frequency in
hertz (s1).
This would mean that energy was not a continuous quantity as had been
believed, but rather that it was quantised into discrete packets. Interestingly, at the
time of this work Planck considered this idea more of a mathematical device to
obtain the right answer rather than an actual physical reality of nature.
In 1905, Einstein extended the concept of quantisation and proposed a new
theory of light. Einstein took Plancks suggestion that the vibrational energy of
atoms or molecules in a radiating object was quantised with energy. He argued
that the vibrational energy of the atoms or molecules could only change by
a multiple of hf and therefore proposed that light would be emitted in discrete
packets (quanta) also obeying the equation E = hf.
Gilbert Lewis (18751946) in 1926 named these discrete packets of light
photons (from photos, Greek meaning light).
ramp
stairs
Figure 12.3.1
Worked example
Question
A photon has an energy of 2.8eV.
a Calculate its energy in joules.
Solution
a You will recall from section 10.2 that
1eV = 1.602 1019joules
b Now rearranging E = hf and substituting in the energy and Plancks constant we obtain:
f=
E 4.5 1019
=
= 6.8 1014 Hz
h 6.63 1034
231
12
From Rutherford
to Bohr
Quantised Energy?
Checkpoint 12.3
1
2
spectroscopes
When light is passed through a prism or diffraction grating, the constituent colours
present in the light are revealed in the spectrum produced. The prism separates
the light using the property of refraction, whereas the diffraction grating uses the
property of interference. The traditional device used by scientists to examine
spectra is called a spectroscope. Modern instruments are called spectrometers.
There are two types of spectra: absorption spectra and emission spectra.
Absorption spectra can be produced by passing white light (a continuous
spectrum) through a cool gas. The atoms or molecules in the gas will absorb
certain specific wavelengths (colours) of light. The atom that absorbed the
light is now in an excited state and will spontaneously emit a photon of light,
usually in a different direction. Therefore the original beam of light will now
have certain wavelengths depleted, and these will appear as a series of dark
lines when observed through a spectroscope.
Emission spectra can be produced when a gas is excited. This can be
achieved by heating the gas or by passing an electrical current through a low
pressure gas. The light produced when viewed with a spectroscope will often
be made up of a series of bright coloured lines.
Hydrogen absorption spectrum
source
collimator
slit
lens
grating
telescope
eye
Figure 12.4.1
232
A spectroscope showing
the major components
400 nm
Figure 12.4.2
700 nm
H alpha line
656 nm
Transition n = 3 to n = 2
quanta to
quarks
Ancient peoples have observed the emission colours of a range of materials.
A sprinkling of common salt (sodium chloride) into a flame produces an intense
golden flame, and during the smelting of copper ore the flames in the furnace are
often coloured an intense vivid green.
Spectra are a window into the hidden atomic structure. Each element has its
own unique spectral fingerprint. Chemists and students of chemistry still use
the flame test to identify the constituent elements present in chemical samples.
A continuous emission spectrum can be produced by a hot object, forming
a rainbow or continuum. For example, if you look at the light emitted from a
white-hot piece of metal through a prism or diffraction grating you will see a full
rainbow of colours.
Anders ngstrm (18141874) was the first to make detailed measurements
of the visible spectrum of hydrogen, and in 1885 Johann Balmer (18251898)
commenced a detailed study of the visible emission spectrum for
hydrogen, which is now referred to as the Balmer series.
Table 12.4.1 The Balmer series for hydrogen
Johannes Rydberg (18541919) generalised Balmers
Spectral line
Balmers generalised equation
equation:
1 1
1
=R 2 2
1
1
1
Name Colour
(nm)
n f ni
= R 2 2
nf ni
nf
ni
H
Red
Worked example
Question
Using the generalised form of Balmers equation, calculate the wavelength of the emitted
photon for a transition between n = 5 and n = 3.
Solution
Using the generalised form of Balmers equation to calculate
1
:
1 1
1
1 1
= R 2 2 = 1.097 107 2 2 = 7.80 105 m1
3 5
n f ni
Now take the inverse to determine the wavelength of the emitted photon:
= 1.28 106 m
656.3
486.1
434.0
410.2
n f ni
The test
of time
mee 4
8o2h 3c
233
12
From Rutherford
to Bohr
E=0
0.85
1.5
3.4
n=4
ionised atom
(continuous energy levels)
n=5
n=3
Paschen
series
n=2
excited
states
Balmer
series
Energy (eV)
10
Figure 12.4.3
13.6
ground state
n=1
15
Lyman
series
Spectral region
nf
ni
Lyman
Balmer
Paschen
Brackett
Pfund
Ultraviolet
Visible and UV
Infra-red
Infra-red
Infra-red
1
2
3
4
5
2,
3,
4,
5,
6,
Discovered
3,
4,
5,
6,
7,
4,
5,
6,
7,
8,
5
6
7
8
9
19061914
1885
1908
1922
1924
Worked example
Question
a Predict the values for nf and ni for the transition of the second longest wavelength
emission in the Lyman series.
b Calculate the wavelength for the second-longest wavelength in the Lyman series.
Solution
a The second-longest wavelength in the Lyman series will correspond to the secondlowest energy transition. In the Lyman series we know that nf is always 1. Thus
ni=2 will correspond to the lowest energy transition and ni = 3 will correspond to
the second-lowest energy transition. Therefore nf =1 and ni = 3.
1
b Using the generalised form of Balmers equation to calculate :
1 1
1
1
7 1
= R 2 2 = 1.097 10 2 2 = 9.75 106 m1
1 3
n n
f
Now take the inverse to determine the wavelength of the emitted photon for the
transition between ni = 3 and nf = 1.
= 1.03 107 m
234
quanta to
quarks
Worked example
Question
Using the information in the table, construct an energy level diagram for n=1 to n=5
for the hydrogen atom.
a On your diagram, draw and label the Lyman, Balmer and Paschen series.
b Identify the values of ni and nf for a transition of an electron between energy
levels that would absorb the highest frequency photon for the Paschen series,
based on the energy values provided.
Energy (eV)
1
2
3
4
5
13.6
3.4
1.51
0.85
0.54
c Identify the values of ni and nf for the transition of an electron between energy
levels that would emit the highest energy photon for the Balmer series.
Solution
a Refer to Figure 12.4.3 and construct a similar diagram for the given values n=1
to n=5.
b From your diagram for part a, we see that for the Paschen series ni = 3 to nf = 5 is
the greatest possible jump in energy, therefore this transition would absorb the
highest energy photon.
c From your diagram for part a, we see that for the Balmer series ni = 5 to nf = 2 is
the greatest possible jump in energy, therefore this transition would emit the
highest energy photon.
Checkpoint 12.4
1
2
3
4
5
6
Bohrs postulates
In 1913 Bohr announced the revised planetary model of a hydrogen atom based
upon the quantisation of energy and angular momentum of the electron. His
new RutherfordBohr model included a set of three postulates to address the
identified limitations of Rutherfords model.
Figure 12.5.1
Niels Bohr
235
12
From Rutherford
to Bohr
A Square peg
in a Round
Hole?
Checkpoint 12.5
1
2
236
This photon will have a characteristic frequency and wavelength, which can
be calculated using the relationships E = hf and v = f (where v is c the speed of
light) rearranged in the forms:
E
c
f =
and =
h
f
quanta to
quarks
Balmer series
n=1
nucleus
n=2
n=3
n=4
n=5
n=6
h nf ni
Dividing through by c we obtain a theoretically derived expression in a form
that is equivalent to the generalised Balmer equation:
1 E1 1
1
=
2 2
hc nf ni
n f ni
PRACTICAL
EXPERIENCES
Activity 12.1
237
12
From Rutherford
to Bohr
Worked example
Question
An electron makes a transition between the energy levels 0.85eV and 3.40eV in
a hydrogen atom.
a Determine to which spectral series this photon would belong.
b Calculate the wavelength of the emitted photon without using Balmers equation.
c Predict if the spectral line associated with this transition is visible to the
human eye.
d Evaluate the principal quantum numbers (n) to which each energy level
corresponds.
e Substitute your values of ni and nf into the generalised Balmer equation and
calculate the wavelength of the emitted photon. (This should be the same value
as in part b.)
Solution
a From Figure 12.4.3, the energy level 0.85 corresponds to n = 4 and energy level
and 3.40eV corresponds to n =2. The n = 2 tell us that this transition belongs to
the Balmer series.
b The difference between these two energies corresponds to the energy of the
emitted photon, E = 2.55eV. Convert this to the SI unit joules (remember
1eV = 1.6021019J).
E = 2.55 1.602 1019 = 4.091019 J
Now, using E = hf, we can calculate the frequency of the emitted photon with this
energy:
E 4.09 1019
f= =
= 6.17 1014 Hz
h 6.626 1034
c
Using the relationship v = f where v is c the speed of light, we have = .
f
c 3.0 108
7
= =
= 4.86 10 m
f 6.17 1014
1 1
1
1 1
= R 2 2 = 1.097 107 2 2 = 2 05 106 m1
2 4
n n
f
Now take the inverse to determine the wavelength of the emitted photon for the
transition between ni = 4 and nf = 2.
= 4.86 107 m
Checkpoint 12.6
Explain the relationships between Bohrs atomic model and observed line spectra.
238
quanta to
quarks
PRACTICAL
EXPERIENCES
Activity 12.2
Sr
Ba
Ca
400
Figure 12.7.1
450
500
550
600
Wavelength (nm)
650
700
Emission spectra for (a) stronium, (b) barium and (c) calcium
Figure 12.7.2
750
Measuring the
Magnetic field
of our Sun
239
12
From Rutherford
to Bohr
Universal units
for Length
and Time
The fine structure of closely spaced spectral lines that are 0.10.5 nm apart
is a result of an additional property of the electron called spin, which
was proposed in 1925 by Ralph Kronig (19041995). In 1881, using
interferometry, Albert Michelson (18521931) observed that some even
finer spectral lines (called hyperfine) existed; these were about 0.001nm
apart. The observation was not addressed until 1924, when Wolfgang Pauli
(19001958) proposed the existence of a small nuclear magnetic moment
Bohrs model did not
caused by a non-spherical atomic nucleus.
provide any explanation for these observed phenomena.
a
0.1 nm
0.1 nm
0.001 nm
Figure
12.7.3
0.001
nm
Normal spectral line:
Checkpoint 12.7
Compare the origins and observed separations of the splitting in the normal Zeeman effect and the hyperfine spectral lines.
240
PRACTICAL EXPERIENCES
quanta to
quarks
CHAPTER 12
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Figure 12.8.1
Perform a first-hand
investigation to observe the
visible components of the
hydrogen spectrum.
Process and present
diagrammatic information to
illustrate Bohrs explanation of
the Balmer series.
Solve problems and analyse
information using:
1
1
1
= R 2 2
n f ni
241
12
242
From Rutherford
to Bohr
Chapter summary
f
i
where is the wavelength of the spectral line
R is Rydbergs constant 1.097 107 m1
nf is the final state
ni is the initial state.
Hydrogen has several spectral line series including the
Lyman, Balmer, Paschen, Brackett and Pfund series.
Bohr in 1913 described the revised planetary model of
a hydrogen atom based upon the quantisation of energy
and angular momentum of the electron.
Bohrs three postulates for his atomic model:
1 Electrons exist in stable orbits called stationary
states.
2 Electrons absorb or emit specific quanta of energy
when they transition between stationary states
(orbits), described by the PlanckEinstein relation
hf = Ei Ef.
3 Angular momentum of electrons is quantised.
The RutherfordBohr model, despite its successes, was
nevertheless a hybrid model and many experimental
observations still remained unresolved including:
the spectra of larger atoms
the relative intensity of spectral lines
the existence of hyperfine spectral lines
aspects of the Zeeman effect.
Review questions
quanta to
quarks
Physically speaking
The following table is all jumbled up. Copy the table into your workbook and correctly arrange the information. You may also
add additional features associated with each atomic model. For each atomic model, draw and label a pictorial representation.
Democritus
c 400
b ce
Aristotle
c 300
b ce
Dalton
1801
Thomson
1904
Rutherford
1911
Bohr
1913
de Broglie
1924
243
12
From Rutherford
to Bohr
Reviewing
Solving Problems
useful to scientists.
n f ni
244
Energy (eV)
1
2
3
4
5
13.6
3.4
1.51
0.85
0.54
quanta to
quarks
a Identify the electron transition between the energy
levels in your diagram that would absorb the
highest frequency photon.
b Identify which transition of an electron between
energy levels would emit the highest energy
photon.
c Demonstrate on your energy level diagram how
the Balmer series of spectral emission lines is
produced.
d Identify the transition between energy levels in
the Balmer series that would produce the longest
wavelength photon.
Solve problems and analyse information using:
1
1
1
= R 2 2
n f ni
1
1
1
= R 2 + 2
nf ni
Re
iew
Q uesti o
245
12
From Rutherford
to Bohr
PHYSICS FOCUS
Quanta to Quarks Timeline
H1. Evaluates how major advances in
scientific understanding and technology
have changed the direction or nature of
scientific thinking
246
Figure 12.8.2
Extension
Some additional technologies you might research:
Methods of detecting the non-visible
electromagnetic spectrum, including infra-red,
ultraviolet light, X-rays and gamma rays
Types of magnets available
Electric and electronic circuitry
As you progress further through this module
extend and include more details on your timeline.
13
Birth of quantum
mechanics
A new paradigm
Niels Bohrs hybrid atomic model, which combined classical and
quantum concepts, held centre stage from 1913 through to the early
1920s. From a historical perspective we must remember that World
War I, 19141918, disrupted research laboratories throughout
Europe. The 1920s saw the move away from classical ideas and
culminated at the 1927 Solvay Conference in Copenhagen where
modern day quantum mechanics was born. In the years leading up
to this conference Louis de Broglie proposed that all matter has wave
properties, Erwin Schrodinger developed a mathematical model for
describing wave mechanics, Heisenberg revealed the statistical
nature of quantum theory and formulated his most famous
discoverythe Heisenberg uncertainty principle
and Pauli developed his exclusion principle.
Checkpoint 13.1
Outline how quantum mechanics was started.
A Great Loss
247
13
Birth of quantum
mechanics
Figure 13.1.1
Matter waves
In 1923, Louis de Broglie began with a supposition that was based upon the
PlanckEinstein equation linking energy quanta to frequency. He equated
Einsteins special relativity energymass relationship E=mc2 with the Planck
Einstein equation E=hf:
mc2 = hf
He rearranged it to derive an expression for the momentum of a photon:
mc =
hf
c
hf
c
c
c
Using the relationship =
in the form f = and substituting,
f
he obtained:
h
p=
248
quanta to
quarks
He had derived an expression for the momentum of a photon in terms of its
wavelength.
In 1924, de Broglie proposed the concept of matter waves by applying
his belief in symmetry to propose that all particles of energy should also possess
an associated wavelength. At this time, particles of matter such as electrons,
atoms and alpha particles were known to possess the properties of mass and
charge, but there was no experimental evidence to indicate that these particles
exhibited wave phenomena.
De Broglies proposed matter waves could be calculated mathematically by
using what is now known as the de Broglie equation:
=
h
h
or =
p
mv
Figure 13.2.1
Louis de Broglie
Worked example
Question
a Calculate the wavelength of an electron moving with a velocity of 6.0 105 ms1.
b Compare this calculated wavelength with the wavelength of visible light
(400650nm).
Solution
h
and substituting in the following values:
mv
h= 6.63 1034 Js
me = 9.11 1031 kg
we obtain
6.63 1034 J s
9.11 10
31
kg 6.0 10 m s
= 1.21 109 m
b The visible spectrum ranges from violet light, with the shortest wavelength
(~400nm), to red with a wavelength of 650nm. For the purpose of this comparison
we will use the average wavelength of these extremes 525nm, which corresponds to
a yellow/green colour.
First we need to convert 525nm into metres (5.25 107 m). Now, comparing the
ratio of the wavelength of the electron (1.21 109 m) with the average wavelength
of visible light (5.25 107 m), we discover that the wavelength for an electron
travelling at 6.0 105 ms1 is approximately 430 times smaller than the
wavelength of visible light.
249
13
Birth of quantum
mechanics
Checkpoint 13.2
1
2
3
Outline the significance of the work of the Braggs in Louis de Broglie developing his theorical research.
Determine the wavelength of an electron moving at 3.0 105ms1.
Calculate the momentum of a proton that has a wavelength of 1.01109m.
13.3 Diffraction
Define diffraction and identify
that interference occurs
between waves that have been
diffracted.
250
quanta to
quarks
a
Figure 13.3.1
Light passing (a) around a razor blade, (b) through a slit and (c) through a small aperture
You cant
escape them
Try This!
Observing a diffraction pattern
Simply take two pens or pencils (you can even use
your fingers). Place them very close together, almost
touching, to form a narrow slit. Now hold them up in
front of one eye (about 34cm in front works well) now
close the other eye. Look through the slit at a bright light
source such a fluorescent light. Vary the width of the slit
and observe carefully! What do you see?
Checkpoint 13.3
1
2
Define diffraction.
Sketch a diffraction pattern that would result as light moves around a hair.
251
13
Birth of quantum
mechanics
In the family
Up until this time Davisson and Germer had been unaware of de Broglies
hypothesis, but during a meeting at Oxford in 1926 Davisson heard of
de Broglies proposed matter waves. He immediately realised that the new results
he and Germer had observed were similar to those of X-ray diffraction patterns.
With this insight, Davisson and Germer quickly identified that the scattered
electrons were indeed producing diffraction patterns.
At the same time, George Thomson (18921975) conducted a different
experiment in which he fired a beam of electrons through a thin gold foil and
observed diffraction rings. Therefore, in 1927, two different research teams using
different experimental techniques had verified Louis de Broglies hypothesis
concerning the wave nature of electrons.
a
b I
V = 54 V
electron
gun
50
0
power supply
15 30 45 60 75 90
electron
detector
incident waves
in phase
50
nickel crystal
Figure 13.4.1
scattered waves
in phase
GP Thomson
crystal
surface
Figure 13.4.2
Davisson and Germers apparatus and results. With the single crystal nickel
target, Davisson and Germer detected several peaks in scattered electrons.
Checkpoint 13.4
1
2
252
Bohrs first postulate stated that an electron can exist in any of several circular
orbits with no emission of radiation. However, he provided no supporting
reasoning to explain why the orbiting electron would not radiate away its energy,
as predicted by Maxwells classical electromagnetic theory, and simply spiral into
the nucleus.
When de Broglie formulated his concept of matter waves, he envisioned that
electron orbits were standing waves. Each orbit (stationary state) was occupied
by an integral number of wavelengths that fitted around the circumference of
the orbit.
The circumference of a circular orbit is 2R and hence an integral number
of wavelengths:
n = 2R
quanta to
quarks
where n is 1, 2, 3, 4, 5 , is the wavelength and R is the radius of the orbit.
To visualise these standing waves we first imagine a standing wave on a
slinky spring or rope, obeying the condition n where n can have the values n =
1, 2, 3 etc. When you shake a rope or a slinky spring you can produce standing
wave patterns that contain whole or integral numbers of wavelengths.
Now we can take these standing waves and wrap them end-to-end to form a
circular standing wave that corresponds to the circular Bohr orbit. These values
of n are called the principal quantum numbers and they tell you the number of
wavelengths fitting into the Bohr orbit.
Remember that the standing waves in Figure 13.5.1 represent the matter
wave nature of the electron. Often students imagine the electrons travelling in
wavy orbital paths around the nucleusthis is not the correct interpretation.
n=1
Try This!
Standing waves
Using a slinky spring or a rope, generate standing waves by having one
person hold one end firmly while the other person shakes their end.
Once you get a stationary wave you will notice that if you stop shaking,
the standing wave is quite stable and continues to vibrate. On a rope or
slinky you can generate standing waves other than those shown in
Figure 13.5.1. Why are these other standing wave patterns not suitable
for forming circular standing waves? (Hint: Consider interference.)
n=2
Figure 13.5.1
n=3
n=4
Checkpoint 13.5
1
2
Outline how de Broglies matter waves help to explain the stable orbits of electrons.
Calculate the radius of the smallest orbit of an electron with wavelength 1.21 109m.
13
Birth of quantum
mechanics
h2 d2
+ V ( x ) = E
2m dx 2
When solved, the equation provides a wave function. The wave function
contains all the measurable information about a particle. If you square this
function you obtain a probability density, which allows you to predict the
likelihood of finding a particle. This very quickly successfully addressed a large
variety of atomic and molecular problems. The price paid for this new way of
looking at matter was that the classical and deterministic view of the atomic
world was totally replaced by a system in which you could only calculate the
probability of finding a particle in a certain place and time.
Erwin Schrodinger
A famous cat
and inspiring a
new career path
to DNA
254
Heisenberg met Bohr in 1922, he was just 20 years old and working toward his
PhD. At the end of a lecture Heisenberg raised an objection to what Bohr had
been discussing. Bohr was so impressed that he invited Heisenberg on a walk and
they discussed the difficulties of quantum theory for more than 3 hours. Bohr
liked physical models of atoms, but Heisenberg thought it was nonsense to talk
about electron orbits when it was obvious that you could not observe the
electrons or their orbital paths.
Heisenberg went on to discover the first complete version of quantum
mechanics. The new theory that was developed in parallel with Schrodinger had
taken a completely different path. Heisenbergs theory had abandoned any
attempt to create a visualisation of the atom, ignored the ideas of waves and
particles, and in itself was purely mathematical.
Heisenberg took de Broglies proposal and Schrodingers wave equation and
was able to ascertain that there was a limit to how precisely you would measure
pairs of quantities. The pair he identified was that of position and momentum.
In 1927 Heisenberg announced his uncertainty principle, which
is expressed mathematically as:
h
h
x p
and E t
2
2
where x is the uncertainty in the position of a particle, p is the uncertainty in
the momentum of a particle and h is Plancks constant.
You can gain some insight into understanding the uncertainty principle by
considering the following example. If you want to know the position of a
particle, you will need to use a high-energy photon that has a small wavelength
as a high-resolution probe to locate your particle. Now consider that the photon
bounces nicely off the particle and you detect the photon and calculate the
position of your particle. You have located the particle very precisely, but the
particle is now recoiling and you will have very little information about its
momentum.
An even more bizarre implication of Heisenbergs uncertainty principle
(pointed out by Einstein) is that particles (i.e. energy) can appear out of the
nothingness of space for a very brief period of time, and then disappear.
quanta to
quarks
These particles are called virtual particles and we will discuss some aspects of them
in Chapter 15 The particle zoo.
In summary, Heisenbergs contributions to atomic theory include:
1 the development of his matrix mechanics, which, like Schrodingers wave
equation, allowed the nature of the atom and especially spectra to be explained
2 the uncertainty principle, which set limits on the precision of measurements.
Checkpoint 13.6
1
2
3
Figure 13.6.2
Werner Heisenberg
Figure 13.6.3
Wolfgang Pauli
255
13
Birth of quantum
mechanics
PRACTICAL EXPERIENCES
CHAPTER 13
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Chapter summary
256
Review questions
quanta to
quarks
Physically speaking
Copy and complete the following text.
______________ atomic model published in ______________ was a mix of
______________ and ______________ concepts. In ______________ Louis de Broglie
proposed that particles had both ______________ and ______________ properties.
______________ developed the idea of ______________ waves, which, when applied
to the situation of an electron ______________ in Bohrs atomic model, provided
mv
1
E1 where E1=13.6eV
n2
to calculate the ______________ of the electron in a given orbit (n). You can then
rearrange the relation E=hf to calculated the frequency. Substituting the
frequency into = c/f you can calculate the ______________ associated with the
electron for the orbit.
You calculate the wavelength for n=1 and the first orbit (n=1) has a
circumference equal to one ______________. You recalculate the wavelength for
n=2 and the second orbit (n=2) has a ______________ equal to ______________
wavelengths.
In 1927 ______________ and ______________ fired electrons at a ______________
target and observed ______________ phenomena that verified the ______________
______________ duality of electrons.
Reviewing
1
2
3
4
5
Define diffraction.
Describe how interference can be achieved.
Recall instances in which diffraction occurs.
Outline an experiment you could conduct to observe interference.
In terms of the Bohr atomic model, explain how matter waves were used to
account for the stability of electron stationary states.
7 Louis de Broglie proposed that any kind of particle has both wave and
particle properties. Recall the response given by the scientific community.
of atomic theory.
257
13
Birth of quantum
mechanics
Solving Problems
Solve problems and analyse
information using:
h
=
mv
mv
a Recall what each term represents.
b Calculate the wavelength of an electron in a television tube with
a velocity of 3.0107 ms1.
Re
258
iew
Q uesti o
quanta to
quarks
PHYSICS FOCUS
The Microscope:
How small can we see?
H3. Assesses the impact
of particular advances in
physics on the development
of technologies
Describe the impact of de Broglies
proposal that any kind of particle has
both wave and particle properties.
Extension
5 Outline the different ways in which specimens are
prepared if they are to be examined with an
electron microscope.
6 Discuss why the way in which most specimens
need to be prepared differs from they way they are
prepared when we use an optical light microscope.
7 Investigate the advantages and disadvantages of
using an electron microscope.
259
14
neutron, beta decay, atomic mass,
nucleons, strong nuclear force,
atomic number, nuclides,
transmutation, alpha decay, decay
series, atomic mass unit, fission,
nuclear reactor, controlled nuclear
reaction, atomic pile, critical mass,
uncontrolled nuclear chain reaction,
neutron scattering
20th century
alchemists
Base metals to gold
The idea of changing base metals into gold has been an age-old
quest of alchemists. Their attempts were not successful because
all the types of reactions the alchemists performed were merely
chemical reactions. To achieve their dream, they needed to
change the identity of the atom by changing the number of
protons in the nucleus. In this chapter we will trace the history
of nuclear physics, the modern alchemy, from the early 1930s
through to the construction of the first man-made
nuclear reactor.
Figure 14.1.1
260
James Chadwick
Rutherford proposed the idea of the neutron in 1920. At the time he considered
the neutron to be a system comprising a proton and an electron tightly bound
together. This system provided an explanation for the ejection of electrons from
the nucleus during beta decay. James Chadwick (18911974) set to work to
find the neutron.
Chadwick set up two main experiments in which he fired alpha particles at
a beryllium target. He allowed the unknown radiation generated in the first
experiment to pass through paraffin blocks and in the second experiment to pass
through nitrogen gas.
He then applied the conservation laws of momentum and energy, by setting
up simultaneous equations that involved the known masses and velocity
measurements of the alpha particle and ejected nuclei from both experiments.
Chadwick solved these equations in order to determine the likely mass of the
unknown radiation. A month after commencing these experiments, Chadwick
submitted for publication a paper entitled Possible existence of a neutron in
which he reported the neutron to be just slightly more massive than a proton.
quanta to
quarks
Property
Proton
Neutron
Charge
Mass
+1.6 1019 C
1.673 1027 kg
Neutral
1.675 1027 kg
With the discovery of the neutron, the nucleus of the atom was considered to
consist of protons and neutrons; the number of protons equalling the number
of electrons in the neutral atom. The number of neutrons could vary to account
for the observed differences in the atomic mass of the atom. The proton and the
neutron are referred to as nucleons, when they are in a nucleus.
Be
foil
particles
Figure 14.1.2
Be
foil
paraffin
protons
(hydrogen
nuclei)
N2 gas
nitrogen
nuclei
particles
Checkpoint 14.1
1
2
3
Repulsive Protons
261
14
20th century
alchemists
ractiv
Clearly, the force of gravity was too weak to overcome the huge electrostatic
repulsive force produced by the interaction between protonsa third
fundamental force of nature was required to explain the stability of the nucleus.
Experiments over the period from 1930 to 1950 showed that this new strong
nuclear force possessed the following properties.
1 The strong nuclear force does not depend on the charge, therefore all
nucleons (protons and neutrons) bind together with the same force. This is
supported by evidence that protons and neutrons are equally likely to be
ejected from a nucleus in a collision.
2 The strong nuclear force acts over short distances of about 1 1015m, the
diameter of a nucleus, and within this range the force is much stronger than
the electrostatic forces. The evidence supporting this is that otherwise the
nucleus would have a tendency to attract more protons and neutrons.
3 The strong nuclear force between the nucleons acts only between adjacent
nucleons. The evidence supporting this is based upon the observed stability
of the nucleus.
nte
M o d u le
repulsion
repulsion
Figure 14.2.1 shows the forces between two nucleons, and from the graph
you can see that the strong nuclear force is at a maximum at a distance of
approximately 1.3 1015 m. If the nucleons are less than 0.5 1015 m apart,
a repulsive force is present.
If we consider the resultant force acting on a proton or an alpha particle
approaching a nucleus, we see that the Coulomb repulsion increases and then
sharply decreases once the particle is within the range of the strong nuclear force
(see Figure 14.2.2).
Coulomb repulsion
Figure 14.2.1
Separation
of nucleons
1015 m
Force
attraction
attraction
Force
range of the
nuclear force
Distance
from nucleus
nuclear attraction
Figure 14.2.2
Checkpoint 14.2
1
2
What is the difference in the magnitude of the electrostatic and gravitational forces between protons in the nucleus?
List the properties of the strong nuclear force.
262
quanta to
quarks
The number of neutrons (N) can be calculated by subtracting the atomic
number (Z) from the mass number (A).
N=AZ
The nuclear structure of an element is represented by ZA X where X is the
element symbol.
Atoms of each element have the same number of protons in their nuclei,
but the number of neutrons can vary, and these atoms are called isotopes.
There are 91 naturally occurring elements and we have identified more than
2500 different nuclei (or nuclides) associated with these. Of these isotopes,
90% are unstable and often quickly decay into other nuclei.
Checkpoint 14.3
Copy and complete the following table to identify the number nucleons and electrons in elements.
Element
Mass number
Atomic number
Protons
Neutrons
Electrons
C
He
F
Xe
14.4 Transmutation
Artificial transmutation
The process of changing one element into another is called transmutation.
Transmutation occurs naturally in stars through the process of nuclear fusion and
here on Earth via natural radioactivity by which certain elements decay spontaneously.
In 1919, Rutherford fired alpha particles into nitrogen gas and detected
a highly energetic particle that he identified to be a proton. This was actually the
first artificially induced transmutationthe alpha particle had collided with the
nitrogen nucleus to produce an oxygen nucleus and a highly energetic proton.
Rutherford initially detected the production of very high energy protons and,
after further experiments in which he used a cloud chamber, he identified that this
was not just a simple collision. Rather his calculations indicated that the nitrogen
nucleus had absorbed the alpha particle and then ejected a high-energy proton.
Rutherford then showed that the new nucleus was oxygen. He had changed
nitrogen-14 into oxygen-17, as prescribed by this equation:
4
2
He +
14
7
N 178 O + 11 H
Natural transmutation
Today we know that nuclei with more than 83 protons (Z > 83) or atomic mass
numbers greater than 209 (A > 209) are unstable and decay. In these atoms, the
repulsive electrical forces between the protons overcome the strong nuclear force.
We also know that instability due to odd number pairing of protons and neutrons
The two most common natural decay processes
can occur in smaller nuclei.
are the emission of alpha and beta particles.
Describe nuclear
transmutations due to
natural radioactivity.
263
14
20th century
alchemists
N
238U
146
4.47 109 y
145
234Th
144
24.10 d
143
142
141
140
138
1600 y
137
136
134
131
214Pb
27 min
129
Tl
19.9 min
214Po
decay
decay
1.637 104 s
1.30 min
127
214Bi
19.9 min
130210
128
218Po
3.10 min
210Pb 22.3 y
210Bi
5.01 d
126
125
210Po
138.38 d
206Pb
124
80 81 82 83 84 85 86 87 88 89 90 91 92 93 Z
Hg Tl Pb Bi Po At Rn Fr Ra Ac Th Pa U Np
Figure 14.4.1
Alpha-particle emission usually occurs with large unstable nuclei. An alpha particle
is identical to a helium nucleus 42 He . After the nucleus has emitted the alpha
particle, the atomic mass A decreases by four, corresponding to the loss of two
protons and two neutrons, the atomic number Z decreases by two, corresponding
to the loss of two protons, and the number of neutrons N decreases by two,
corresponding to the loss of two neutrons.
The general equation for this process is:
A
A4
4
AZ 42Y + or more formally Z X Z 2Y + 2 He
X represents the nuclei of the parent element and Y represents those of the
daughter element.
A common naturally occurring alpha decay is that of uranium-238 decaying
to thorium-234:
A
ZX
226Ra
222Rn
3.82 d
135
132
230Th
7.54 104 y
139
133
234Pa
70 s 234U
5
2.46 10 y
Alpha decay
238
92 U
234
90Th
238
92 U
+ or more formally
234
90Th
+ 42 He
Beta decay
There are three different types of beta-decay processes: beta-minus (), beta-plus
(+) and electron capture.
The Unseen
264
A
Z +1Y
+ + e or more formally
A
ZX
A
Z +1Y
0
1 e
+ 00 e
The atomic mass Z remains the same, as the total number of nucleons is unchanged.
The atomic number A increases by one and the number of neutrons N decreases by
one, corresponding to the transformation of a neutron into a proton.
A commonly occurring beta-minus decay is thorium-234 to protactinium-234:
234
90Th
234
91 Pa
+ + e or again formally
234
90Th
234
91 Pa
0
1 e
+ 00 e
A
Z 1Y
+ + + e
or more formally
A
ZX
A
Z 1Y
0
+1 e
+ e
19
10 Ne
199 F +
0
+1 e
+ 00 e
quanta to
quarks
A
Electron capture
238
234
+ e 73 Li + e
The three types of beta decay are mediated by the fourth force of
nature called the weak nuclear force, which was proposed by Enrico
Fermi in 1934 to explain beta decay.
As Rutherford and Soddys 1903 paper stated, often a series
of decays occurs. For example, in nature the decay series for
urainium-238 is common. Figure 14.4.1, a Segr chart, and Figure
14.4.2, a plot of atomic mass against atomic number, both show the
decay series for uranium-238 ( 238
92 U ), which undergoes eight alpha
decays and six beta decays, terminating at the stable isotope
). The original nucleus for each transition is called
lead-206 ( 206
82 Pb
the parent and the resultant nucleus is called the daughter.
Gamma radiation
After the emission of an alpha or beta particle, the daughter nucleus
is sometimes left in an excited state. The change in energy from this
excited energy state of the nucleus to the ground state of the nucleus
will result in the emission of a very high energy photon called a
gamma ray and is represented by the Greek letter . Gamma
radiation does not change the atomic number (Z) or the atomic
mass (A) and therefore it is not an example of transmutation.
A general equation for gamma emission can be written as:
24 d 6.7 h
2.5 x
decay
105
7.5 x 104 y
226Ra
226
1600 y
222Rn
222
3.8 d
218
3.1 min
214
218At
218Po
214Pb
27 min
20 min
218Rn
1.6 s 0.04 s
214Bi 214
Po
20 min
1.6 x 104 s
210Tl 210
Pb 210Bi 210Po
210 1.3
22 y 5 d
min
5 d 138 d
206Tl
206Pb
206 4.2
81min82
Tl
Pb
83
Bi
84
Po
Figure 14.4.2
85
At
86
Rn
87
Fr
88
Ra
89
Ac
90
Th
91
Pa
92 Z
U
PRACTICAL
EXPERIENCES
Activity 14.1
nte
ractiv
M o d u le
Checkpoint 14.4
Define a transmutation and outline how Rutherford made this happen.
Identify how Rutherford showed that the high-energy protons were not the result of a collision.
List the properties of an atom that will naturally transmute.
List the types of decay that can occur to help stabilise a nucleus and the properties each possess.
1
2
3
4
234U
230Th
230
A
ZX
+
The * denotes the nucleus is in an excited state.
234Pa
234Th
decay
A *
ZX
4.5 x 109 y
17
4 Be
238U
9
8
7
6
5
4
3
2
1
0
1 2 3 4 5 6 7 8 9 10 11 12
Kinetic energy of the -particles (MeV)
Figure 14.5.1
14
20th century
alchemists
value that was dictated by the parent nuclei. Refer to Figure 14.5.1 for a typical
energy distribution for beta particles.
In 1930, Wolfgang Pauli (19001958), in a letter addressed to
LiseMeitner and Hans Geiger, proposed that a new particle was also emitted
during the beta-decay process. Pauli proposed that this new particle had no
charge and would only very weakly interact with matter. This newly proposed
particle would allow the distribution of energies to be explained. The energy
released during a decay could be shared between the beta particle and the
neutrino in any ratio; thus, the spectrum of energies observed could be explained.
Enrico Fermi formally proposed a theory for beta decay whereby a neutron
in the nucleus of an atom was transformed into a proton, an electron and an
antineutrino. His theory also proposed a fourth force of nature, the weak nuclear
force (commonly referred to as the weak force). The scientific community
accepted the theory, even though Reines and Cowan did not experimentally
detect the neutrino until 1953.
Checkpoint 14.5
Worked example
Question
a Convert 4.05 amu to kilograms.
b Convert 5.023 1027 kg to amu.
Solution
a 4.05 1.6605 1027 = 6.73 1027 kg
b
5.023 1027
1.6605 1027
= 3.025 amu
quanta to
quarks
Worked example
Question
Calculate the equivalent energy contained in 0.0015amu.
Solution
0.0015 931.5 = 1.4 MeV
Mass defect
The total mass of a stable nucleus is less than the total sum of the masses
of the protons and neutrons it contains. How can this be? Einstein linked mass
to energy in his equation E = mc2. In our case, when a nucleus forms, some
energy is radiated away. This loss of energy reduces the mass of the nucleus.
Conversely, when the unstable nucleus of a large atom splits into two or more
fragments, the combined mass of the daughter nuclei (and possible neutrons) is
less than the mass of the parent nucleus. This loss of mass is caused by some of
the original parent mass being transformed into the kinetic energy of the
fragments and also possibly gamma radiation.
The difference in mass between the total original mass and the total final
mass is called the mass defect.
Worked example
Question
Calculate the mass defect for a helium-4 atom ( 42 He), given the rest mass of a helium
atom is 4.002602amu.
Solution
A helium-4 atom consists of a nucleus (2 protons and 2 neutrons) and 2 electrons.
Calculate the total mass of these constituent particles:
Calculate the difference in masses by subtracting the total mass of the constituent
particles from the actual mass of the atom:
4.032980 amu 4.002602 amu = 0.030378 amu
Therefore the mass defect for the helium-4 atom is 0.030378 amu.
Figure 14.6.1
Checkpoint 14.6
1
2
3
4
State what amu stands for and give reasons for its use.
Convert 3.1amu to kilograms.
Calculate the number of MeV from 0.3amu.
Define mass defect.
267
14
20th century
alchemists
Worked example
Question
Calculate the binding energy for the helium-4 ( 42 He) nucleus.
Solution
In our previous calculation we determined the mass defect for the helium-4 atom ( 42 He) to
be 0.030378 amu.
Using Einsteins equivalence between mass and energy we simply convert the mass
defect into the units of energy by using the relationship 1amu = 931.5MeV.
The binding energy for helium-4 nucleus ( 42 He) = 0.030378 931.5 = 28.3MeV.
To investigate the stability of various nuclei, the binding energy (Eb) per
nucleon can be calculated by dividing the binding energy by the total number
of nucleons (protons and neutrons) in the nucleus.
E
binding energy
Binding energy per nucleon =
= b
atomic mass number A
Worked example
Question
Calculate the binding energy per nucleon for helium-4 nuclei ( 42 He).
Solution
In our previous calculation we determined the binding energy for the helium-4 atom ( 42 He)
to be 28.3MeV. We know that the atomic mass number for helium-4 is 4.
E
binding energy
Therefore:
Binding energy per nucleon =
= b
atomic mass number A
28.3 MeV
= 7.075 MeV
4
We can see from Figure 14.7.1 that elements with atomic mass numbers
between 40 and 80 have nuclei that are tightly bound. The binding energy of
elements with atomic mass number greater than 80 is slightly less, and the
binding energy of elements with atomic mass number less than 40 decreases
sharply. You will notice that 42 He has quite a high binding energy per nucleon,
which makes it very stable and explains why alpha particles 42 He rather than
single protons are ejected from nuclei in alpha decay.
The graph also shows that if you fuse light nuclei together you would
increase the binding energy per nucleon, thus energy would be released. This
process, called fusion, is the nuclear reaction that provides the energy source for
stars. You can see that if the nucleus of a heavy atoms splits, the binding energy
per nucleon will increase. This process is called fission, and is the energy source
used in nuclear reactors.
=
268
quanta to
quarks
9
20
16O Ne
12C
4He
Ca Fe
Kr
Hg
greatest
stability
fission
6
5
4
3
fusion
2
1
0
0
Figure 14.7.1
50
100
150
Mass number (A)
200
Checkpoint 14.7
1 Define binding energy.
2 Referring to the graph (Figure 14.7.1), explain how binding energy can help you understand when energy is
released or absorbed.
269
14
20th century
alchemists
Checkpoint 14.8
1 Identify what a transuranic particle is.
2 Outline the experiment performed by Fermi which resulted in nuclear fission and not the production of
transnuranic elements.
Earths Own
Fission Nuclear
Reactor
270
U+n
239
92
90
36
Kr + 146
Ba + 3n
56
The scientists working on this research realised that this chain reaction if
controlled could be used as a useful power source, but if it was allowed to
proceed uncontrolled, it could produce a huge explosion. In August 1939, war
with Germany seemed imminent and the American-based scientists felt that
America should investigate the possibility of building a bomb. Leo Szilard,
Eugene Wigner and Edward Teller drafted a letter that was signed by Einstein
and sent to the US President Franklin D Roosevelt.
It was known that 238
92 U would not support a chain reaction because the
neutrons released during the fission process did not have the right energy to
cause other 238
92 U nuclei to split. However, the rarer naturally occurring isotope
235
acted differently and was known to fission when struck by a slow neutron.
92 U
In 1935 Enrico Fermi and his team set about designing and carrying out an
experiment to see if a controlled nuclear reaction was possible. In 1942 Fermis
team designed and commenced building a fission reactor in a squash court at
Chicago University. Some 40000 graphite bricks, weighing 350 tonnes, were
used as a moderator to slow down the high-energy neutrons. They also provided
the reactor with a structural component that gave the reactor its name, the pile.
Natural uranium in the form of uranium oxide, which contains about 0.7%
235
, provided the fuel for this reactor, and a set of control rods made from
92 U
sheets of cadmium nailed to sticks of timber acted as neutron sponges. These
could be withdrawn or inserted into the pile to control the number of neutrons
and thus the rate of the fission reactions.
The atomic pile took 6 weeks to construct, and in early December 1942 the
control rods were withdrawn 15cm at a time. With each withdrawal the neutron
counters would climb and level off. At 2:00pm, Fermi announced that, based
upon his calculations, the next withdrawal would result in a self-sustaining
reaction. As predicted, the neutron intensity increased more and more rapidly.
1
After 4 2 minutes Fermi raised his hand and announced: The pile has gone
critical. The control rods where pushed back in and the reactor shut down.
quanta to
quarks
Figure 14.9.1
The conditions for a controlled nuclear chain reaction are such that the
available neutrons, which cause the fission, are regulated. The control
mechanisms in a nuclear reactor include the use of neutron-absorbing materials
within the physical structure and the manipulation of control rods.
In an uncontrolled nuclear chain reaction, such as in a nuclear
reactor meltdown or in a fission atomic bomb, the production of neutrons goes
unchecked and the fission reactions increase at an accelerating rate. This process
releases an enormous amount of energy in a very short period of time and
results in a significant explosion.
Checkpoint 14.9
1 Explain how a chain reaction can be sustained in nuclear fission.
2 Outline Fermis experiment to show nuclear fission.
3 Give reasons why artificial nuclear fission is a desired process.
271
14
20th century
alchemists
The discovery of the neutron and the further development of nuclear reactors
provided scientists and industry with a powerful analysis tool. Chadwicks work
had revealed that neutrons were subatomic particles with a mass approximately
the same as that of the proton, and no net electric charge. From de Broglies
work we also know that neutrons will exhibit wave properties and their de Broglie
wavelength will vary, depending on their velocity. Interestingly, although
neutrons have no net charge, it was discovered they do have the property of spin;
they therefore possess a magnetic property and act like little subatomic magnets.
These properties make the neutron an excellent tool with which to probe aspects
of atomic structure.
There are four main applications of neutron scattering:
Spacing: The de Broglie wavelength associated with slow (thermal) neutrons
allows investigations to determine the structure and spacing between atoms
in solids and liquids.
Motion: The energy of slow (thermal) neutrons is similar to the energies
associated with the movements of atoms in solids and liquids, allowing
researchers to investigate atomic vibrations and the forces between atoms.
Magnetic structure: The neutron acts like a subatomic magnet, which allows
researchers to examine at the atomic level the structures of semiconductors
and magnetic materials.
Inner structure: X-rays and electrons are scattered by atomic electrons, but
neutrons are scattered by atomic nuclei and, unlike alpha particles, are not
influenced by electrostatic forces. This allows neutrons to penetrate dense
materials to depths of several centimetres, therefore making it possible for
researchers to study material deep inside large pieces of equipment (such as
aircraft engines), and inside containment vessels that have varying conditions
of pressure, temperature and environment.
The first neutron-scattering experiments were carried out by Ernest Wollan
and Clifford Shull in 1945 using another graphite pile reactor built at Oak Ridge.
Today neutron diffraction is used in combination with X-rays to investigate the
structures and properties of a wide range of materials.
The Bragg Institute at the Australian Nuclear Science and Technology
Organisation (ANSTO) OPAL reactor facility, located just south of Sydney,
currently houses eight neutron-beam instruments. The neutron-scattering and
X-ray techniques are used to solve complex research and industrial problems
across a wide range of fields including engineering, pharmaceuticals, mining,
plastics and biology.
Checkpoint 14.10
1 Outline how neutrons can be used to probe atomic structure.
2 List applications of neutron scattering.
272
PRACTICAL EXPERIENCES
quanta to
quarks
CHAPTER 14
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Perform a first-hand
investigation or gather
secondary information to
observe radiation emitted
from a nucleus using the
Wilson Cloud Chamber or
similar detection device.
273
14
274
20th century
alchemists
Chapter summary
Review questions
quanta to
quarks
Physically speaking
Pretend you are Charles Wilson, the inventor of
the cloud chamber, shown in Figure 14.11.1.
Name the components of your new invention.
Prepare a short scientific report on your new invention
for presentation to your class.
F
D
Reviewing
Fission
the neutron.
Fusion
Alpha emission
PROPERTIES
Neutron
Beta emission
Proton
Symbol
Charge
Mass (kg)
275
14
20th century
alchemists
Scientist(s)
1920
1930
1932
Contribution
1934
Experiment
______________
repulsion
Force
Coulomb repulsion
range of the
nuclear force
distance
from
nucleus
KE of beta particles
Figure 14.11.3
attraction
Figure 14.11.2
Discuss Paulis suggestion of the existence
of the neutrino and relate it to the need to
account for the energy distribution of electrons
emitted in -decay.
276
Solving problems
14 Consider a helium nucleus where the distance
quanta to
quarks
Solve problems and analyse information to
calculate the mass defect and energy
released in natural transmutation and fission
reactions.
9
8
7
16O
4He
56Fe
208Pb
12C
235U
9Be
6Li
5
4
2.9 MeV/nucleon
3 3H
3He
2
1.15 MeV/nucleon
2H
1 1H
50
100
150
200
250
Figure 14.11.4
? + 3 01n
1
65
66
0n + 29Cu 29?
22
?
11Na 10?
+ +01e +
Q uesti o
Re
277
14
20th century
alchemists
PHYSICS FOCUS
Probing Atomic Structure
In 1934 Ernest Rutherfords prediction of neutrons
was proved by his associate James Chadwick, work
that gained Chadwick the Nobel Prize in Physics
in 1935. More recently, Bertram Brockhouse and
Clifford Shull shared the Nobel Prize for Physics in
1994 for pioneering contributions to the development
of neutron-scattering techniques for studies of
condensed matter. This is the modern application of
neutron scatteringdetermining the arrangement
and motion (structure and dynamics) of atoms and
molecules, both of which can determine the physical
properties of the material they form.
Today, these methods are practised at neutronscattering centres such as the Bragg Institute at the
Australian Nuclear Science and Technology
Organisation (ANSTO) located at Lucas Heights,
south of Sydney. Neutrons are used to study atomic
positions, motions and magnetic properties of
materials. Neutron scattering provides unique
information about a material that cannot be obtained
by other methods.
Extension
4 Investigate and report on the ways in which you
could detect scattered neutrons.
5 Neutron scattering is very sensitive to different
isotopes of an element (atoms that have nuclei
with the same number of protons but different
numbers of neutrons). Outline some reasons why
this would be the case.
Figure 14.11.5
Dr Vanessa Peterson is changing
a sample in a high-temperature
furnace near Echidna, the highresolution neutron powder
diffractometer at ANSTO.
In this experiment, the structure
of the material was determined,
as it changed with temperature.
278
15
Manhattan Project, isotope,
Los Alamos, fusion, fission, fuel rods,
core, moderator, control rods, coolant,
radiation shielding, radioisotopes,
half-life, particle accelerator, cyclotron,
positron, radiopharmaceuticals,
radiotherapy, irradiated, cosmic rays,
muons, synchrotron, linear
accelerators, quarks, hadrons, bosons,
pions, Standard Model, mesons,
baryons, leptons
PRACTICAL
EXPERIENCES
Activity 15.1
279
15
The Manhattan Project left a legacy that shaped the 20th century in ways
that few could have imagined. The project led to the post-World War II nuclear
arms race between the Soviet Union and the West. After the test detonation of
the Soviet Unions first bomb in 1949, relations between the Soviet Union and
the West deteriorated. This tension became known as the Cold War and
continued until the early 1990s; it ended with the dissolution of the Soviet
Union. The United States of America, Britain and the Soviet Union developed
fusion bomb technology, known as the thermonuclear bomb, the H-bomb or
the hydrogen bomb. By the mid-1960s France and China had tested atomic
weapons, and today Israel, India, Pakistan and North Korea also possess atomic
weapons. Despite the Nuclear Non-proliferation Treaty, which pursued a
reduction in weapon stockpiles, today there is a worldwide arsenal of
approximately 20000 nuclear weapons.
In tandem with the military developments, civil projects using nuclear
energy have also proliferated. Today there are 440 nuclear power plants
worldwide, which supply 15% of the worlds electricity demand, and there are
more than 200 research reactors. The use of reactors for nuclear medicine,
neutron scattering and other industry-based applications highlights the peaceful
endeavours pursued by scientists and engineers today.
Checkpoint 15.1
1
2
3
4
280
quanta to
quarks
pressuriser (prevents
water boiling)
containment building
primary coolant
circuit (high
pressure
water)
boiler
pump
water
condenser
turbines and
electricity
generator
to grid
control
rod
Figure 15.2.1
sea water to
condense steam
fuel
rod
Nuclear energy
of fuel
Heat energy
of coolant
Kinetic energy of
steam and turbine
Electric energy
Light, heat,
sound, etc.
Figure 15.2.2
The nuclear reaction resulting from the absorption of a slow (or thermal)
neutron by uranium-235 has many possible pathways for the decay process
to proceed.
Following are three examples:
235
92
U+n
235
92
235
92
236
92
U+n
236
92
U+n
236
92
140
55
93
Cs + 37
Rb + 3n
147
57
90
36
La + 87
Br + 2n
35
Kr + 143
Ba + 3n
56
281
15
Checkpoint 15.2
1
2
Define fission.
Construct a table that lists the six key elements in a nuclear reactor and outline their purpose.
15.3 Radioisotopes
Radioisotopes are increasingly being used in medicine, scientific research and
industry. They provide a unique tool that is often cheaper and more effective
than alternative techniques and processes. Radioisotopes are atoms that have an
unstable ratio of protons to neutrons and will decay via alpha or beta decay to
attain a more stable configuration. Some may also emit gamma radiation. Each
radioisotope has a specific half-lifethe time it takes half of the radioisotope
sample to decay. For example, if a sample contains 12 grams of a radioisotope
with a half-life of 5 hours, in 5 hours time half of the radioisotope will have
decayed and 6 grams will remain. Five hours later there will be 3 grams of the
original radioisotope present in the sample, and so on.
Radioisotopes occur naturally or can be produced by changing the ratio
of nucleons in the nuclei of a target material in a nuclear reactor or a particle
accelerator. A nuclear fission reactor provides a source of neutrons that can be
used to irradiate a target material. A particle accelerator such as a cyclotron can
accelerate protons or ions and fire them into a target material. Both processes are
required to produce the range of radioisotopes used in medical and industrybased applications.
In Australia, the Australian Nuclear Science and Technology Organisation
(ANSTO) facilities in Sydney, produces a range of radioisotopes. ANSTO
operates Australias only nuclear research reactor, OPAL, at Lucas Heights and
the National Medical Cyclotron at The Royal Prince Alfred Hospital. It is
necessary to have both the research reactor and the cyclotron, as each produces
different types of radioisotopes. Pharmaceutical companies, research centres and
hospitals operate other medical and industrial cyclotrons.
Radioactive tracers
Often radioisotopes are used in medical and industrial applications to track the
movement, flow or absorption of materials. The radioisotope can be incorporated
into a molecule or introduced into the system. These isotopes then act as tracers
and allow the position, flow or absorption to be mapped. In complex biological
systems it is possible to map the pathway of specific chemicals being transported
and used by cell tissue. For example, glucose is metabolised by brain tissue and so
the radioisotope carbon-11 can be placed into the glucose molecule to act as a
tracer in medical imaging. The carbon-11 via beta-decay is a positron emitter,
and the radioactive labelling of the glucose molecules can be tracked and imaged
using positron emission tomography (PET). An industrial application is the use
of a radioactive tracer in mapping the dispersion of a sewerage ocean outfall.
282
quanta to
quarks
Medical radioisotopes
Nuclear medicine has been used routinely since the 1970s. Radioisotopes that
decay quickly are said to have short half-lives, and are often used in diagnostic
procedures. Radioisotopes with longer half-lives are often used therapeutically to
target diseased organs and tumours.
Radiopharmaceuticals
Radiopharmaceuticals can be classified as diagnostic or therapeutic. The
overarching process on which this pharmacology is based is the specific uptake
and absorption of chemicals by organs and specific body tissues. This provides an
ideal mechanism that enables radioisotopes to be incorporated into or attached
to molecules, which are then tracked or used to target specific tissues.
Diagnostic radiopharmaceuticals are used to assess the functioning of organs
including the lungs, heart, liver and brain; identify bone fractures not visible in
X-rays; and assess the flow of fluids such as blood. The decay of the diagnostic
pharmaceuticals can be monitored by detectors that may comprise a simple
device such as a Geiger counter or a complex array of detectors that convert the
information into an image. Diagnostic radiopharmaceuticals subject the body to
a very low radiation dose, usually comparable to a routine diagnostic X-ray.
Therapeutic radioisotopes generally contain radioisotopes with longer
half-lives and once absorbed by the specific tissue or organ will deliver a target
dose of radiation.
Radiotherapy
In radiotherapy the radiation emitted from a radioactive source is directed
at an area of diseased tissue. This procedure is referred to as teletherapy when the
source is located outside the body. In brachytherapy a radioactive implant is used
to target the specific tissue.
Industrial radioisotopes
Radioisotopes are used widely in industry across applications as diverse as
checking the structural integrity of bridges, determining wear in engine
components, examining welds in gas lines, imaging internal structures in jet
aircraft engines, and thickness control. Radioisotopes commonly used in
measuring the thickness of materials include iridium-190 and cobalt-60. The
most common (often unrecognised) application of a radioactive isotopes is
smoke detection, and americium-241 is the radioisotope most commonly used
in home-based detector systems.
Figure 15.3.1
283
15
phosphorus-32
Thyroid
technetium-99m
iodine 123
iodine-131
iodine-132
selenium-75
Brain
iodine-131
mercury-197
technetium-99m
oxygen-15
carbon-11
Blood
Heart
iron-59
rubidium-81m
thallium-201
cesium-137
Lungs
xenon-127
xenon-133
nitrogen-13
oxygen-15
carbon-11
indium-113m
technetium-99m
Spleen
chromium-51
rubidium-81
technetium-99m
Kidney
technetium-99m
mercury-197
iodine-131
Liver
technetium-99m
iodine-131
gold-198
Pancreas
selenium-75
Bladder
gold-198
Placenta
iodine-123
iodine-131
carbon-11
Knee
yttrium-90
rhenium-186
Prostate
gold-198
indium-111
Bone
strontium-85
strontium-87
fluorine-18
iron-52
phosphorus-32
technetium-99m
phosphorus-32
arsenic-74
indium-133m
iron-18
Lymph
gold-198
Figure 15.3.2
Agricultural radioisotopes
PRACTICAL
EXPERIENCES
Activity 15.2
Food irradiation
In many countries certain foods are irradiated in order to increase the
shelf life and make some foods safer to eat. The exposure of the food to gamma
radiation targets disease-causing bacteria or those that cause spoilage. A
common, widely used radioisotope is cobalt-60. The gamma rays passing
through the food possess enough energy to destroy many bacteria without
Identify data sources, and gather,
changing the texture or flavour of the food. The food never comes into contact
process, and analyse information
with the radioactive source or other forms of radiation and therefore is not at risk
to describe the use of:
of itself becoming radioactive. Examples of irradiated foods found worldwide
a named isotope in medicine
include potatoes and onions in which sprouting is reduced, grains, meats
a named isotope in agriculture
a named isotope in engineering. including poultry and some fish, many spices and dried herbs, and fresh fruits.
284
quanta to
quarks
PHYSICS FEATURE
Hydrogen as an
energy carrier
Figure 15.3.3
Checkpoint 15.3
1
2
3
4
5
6
7
8
9
15
Electrostatic accelerators
There are two common electrostatic accelerators, which were developed in the
1930s: the CockroftWatson accelerator and the Van de Graaff accelerator. You
maybe familiar with the school version Van de Graaff generator (200kV). The
early accelerators were scaled-up versions capable of accelerating particles across
potentials of 1.5 MV. The Science Museum in Boston, Massachusetts, now
houses the (physically) largest air-insulated Van de Graaff ever built. At the top it
has two joined domes, one housing the top of the belt mechanism and the other
housing the top of the original accelerator tube. The museum now uses the
generator to produce bolts of artificial lightning. Modern Van de Graaff
accelerators are smaller and use a tandem system that can accelerate ions across
a potential of 20MV.
Linear accelerators
Linear accelerators comprise a very long straight evacuated tube that contains
a set of hollow metal drift cylinders (Figure 15.4.1). Each alternate cylinder is
electrically connected to an alternating power supply, which provides an
alternating potential difference. An ion is injected into the tube and is initially
attracted toward the first cylinder. Once the ion is in the cylinder, the electric
field drops to zero and the potential polarity is reversed, so that when the ion
emerges from the cylinder it is repelled from the first cylinder and attracted to
the second. When the ion enters the second cylinder, it enters a region of zero
electric field and drifts at a constant velocity through the cylinder. The polarity
again reverses and when the ion emerges it is repelled by the second cylinder and
286
quanta to
quarks
attracted to the third. This process is repeated for the entire journey of the ion
down the tube. The alternating potential remains at a constant frequency and,
therefore, as the ions increase in velocity, the lengths of the hollow metal drift
cylinders are increased. The Stanford Linear Accelerator Center (SLAC) houses the
most well-known and longest linear accelerator (Figure 15.4.2). It is 3.2 km long.
Cyclotron
In 1932 Ernest Lawrence (19011958) and Stanley Livingston
(19051986) developed a small, compact accelerator that
electrostatically accelerated ions or elementary particles across
a small gap between two semicircular D-shaped hollow metal
cavities called dees (see Figure 15.4.3). Once accelerated
across this gap, the particle is turned through 180 in a near
semicircular path by a constant uniform strong magnetic field.
The particle is again accelerated across the gap and again
turned through 180. The two dees are connected to a
constant high-frequency alternating power source, which
provides the electric field that accelerates the particle across
the gap. The polarity across the gap is reversed as the particles
are being turned. The electric field inside the semicircular
hollow metal dee is zero, and therefore does not affect the
path of the particle. As the velocity of the particle increases,
the radius of its orbit also increases, resulting in an outward
spiral trajectory. One of the dees has an exit point and a beam
of particles is produced. A design limit is imposed upon the
cyclotron because, if you attempt to accelerate particles
beyond energies of 20MeV, the increase in relativistic mass
causes the particles to become out of phase with the constant
high-frequency source. At the ANSTO-operated National
Medical Cyclotron at The Royal Prince Alfred Hospital,
Sydney, the particle beam is used to produce radioisotopes.
source
Figure 15.4.1
electron injector
damping rings
positron
electron
D1
+
bending magnets
focusing
magnets
collision
detector
E
+
D2
high frequency
alternating
voltage
D1
3 km
positron source
Figure 15.4.2
a
D2
ion or electron
source
accelerated
particle beam
Figure 15.4.3
(a) Schematic diagram of a cyclotron. (b) Particles are accelerated across the
gap by an electric field, so that each semicircular spiral has a larger radius.
(c) By the time the particle reaches the gap again, the voltage has reversed and
the particle is accelerated. These diagrams show the acceleration of positive ions.
287
15
Van de Graaff
accelerator
deflecting
magnets
Synchrotron
target
high frequency
alternating
voltage
injection
point
deflecting
magnets
Figure 15.4.4
+ proton source
antiproton
source
Figure 15.4.5
main ring
magnets are
coaxial with
rings
Tevatron ring
collision
detector
Switzerland
Geneve
LHCB
Point 8
France
CERN
Point 1ATLAS
ALICE Point 2
Point 5 CMS
TI 8
LHC-B
SPS
TI 2
ATLAS
ALICE
CMS
Figure 15.4.6
288
/LH
LEP
Figure 15.4.7
quanta to
quarks
tracking
device
particle
electromagnetic hadronic identification
detector
calorimeter calorimeter
photons
muon chamber
hadron calorimeter
E-M calorimeter
tracking
electrons or
positrons
muons
e+
pions or
protons
neutrons
Figure 15.4.8
Figure 15.4.9
Schematic design
for a typical
particle detector
Particle detectors
Particle detectors are designed to record and provide a way of seeing the
explosion of particles that can be generated as a result of the collision. Modern
particle detectors are made up of a series of layers of sub-detectors, each
specialising in identifying particular types of particles or specific properties (see
Figure 15.4.8). These sub-detectors can be classified into three main types:
Tracking devices detect and reveal the trajectory of a particle. Often a
magnetic field is used to bend the trajectory of charged particles into a curve.
The curvature provides physicists with information about the momentum of
the particle. Early detectors such as the cloud chamber and the bubble
chamber provided a direct visual record of the paths of particles. Modern
tracking devices produce electrical signals that are processed by computers to
reconstruct the path of particles. The outer layer of many detectors contain a
muon chamber that detect these weakly interacting particles, which can travel
through many metres of dense material.
Calorimeters used in detectors are of two types: the electromagnetic
calorimeter, which is positioned close to the collision, and the hadronic
calorimeter. These sub-detectors are designed to stop, absorb and measure the
energy of a particle. The electromagnetic calorimeter measures the energy of
lighter particles such as electrons and photons. Hadronic calorimeters absorb
the energy of heavier particles, which contain quarks (these types of particles
are called hadrons, and include pions, protons and neutrons). The
interactions inside both types of calorimeters often cause successive cascades
or showers of new particles and with each interaction or collision the
momentum and energy is shared between more particles, until eventually all
are slowed and stopped. To accurately measure the total energy requires the
detector to stop nearly all particles, and thus explains the huge overall size of
the detectors used. Calorimeters can stop most known particles except muons
and neutrinos.
Particle identification detectors identify the type of particle by various
techniques. Two commonly used methods to detect the nature of charged
particles are Cherenkov radiation, produced when charged particles travel
faster than the normal speed of light of that medium, and transition
radiation, produced when a charged particle crosses the boundary between
certain materials.
289
15
PHYSICS FEATURE
The worlds largest
microscope
290
quanta to
quarks
Figure 15.4.11 ATLAS detector schematic with a simulated detection of a Higgs boson
291
15
Checkpoint 15.4
1
2
3
4
292
In the early 1930s the list of identified sub-nuclear particles comprised the
electron, the proton and the neutron; the positron and the neutrino had
been proposed by Pauli. It was not long before the muon and the positron
were identified in cloud chambers while examining cosmic showers, and by
the 1960s a hundred other particles had been identified. The physicists
were confronted with this ever-increasing zoo of particles, as the energy of
the collisions increased. They had no overall unifying theory to explain the
behaviour or adequately classify these particles. The quest to uncover
which of these particles were truly fundamental became a primary area
of research.
A few of the major discoveries that led to the development of the
Standard Model of matter are discussed here. In 1964 Murray Gell-Mann
(1929) and George Zweig (1937) proposed the existence of quarks to
explain the properties of a family of particles called hadrons. The hadrons
are further subdivided into two types: mesons, those containing two
quarks (comprising a quark and antiquark pair), and baryons those
containing three quarks. In 1967 the electroweak theory, unifying the
electromagnetic and weak nuclear force, was proposed by the collaboration
of Steven Weinberg (1933) and Sheldon Glashow (1932), and
independently by Abdus Salam (19261996). In 1969 the first evidence of
quarks was reported by Jerome Friedman (1930), Richard Taylor (1929)
and Henry Kendall (19261999). Then, during the period of 19701973,
the Standard Model was formulated. All the components of the model
except the Higgs boson were verified over the next 25 year period.
Today the Standard Model of Matter provides a mechanism to classify
and explain the nature of matter. It includes three of the four fundamental
The model has three main
forces but excludes gravity (Figure 15.5.1).
components: quarks, leptons and force carriers. The experiments at
Fermilab and the new CERN LHC to search for the Higgs boson will either
confirm the Standard Models validity or send the experimentalists and
theorists back to formulate a new or revised model. Either way, the major
challenge facing particle physicists is the need to develop a grand unified
theory that will unite all four of the fundamental forces.
quanta to
quarks
Quarks
There are six varieties (commonly called flavours) of quarks and in all there
are 12 types of quarks. The first generation of quarks are named up and down,
the second generation are charm and strange, and the third are top and bottom.
Each quark is considered to be a fundamental point-like particle that carries the
properties of mass, charge and colour. The electric charge of a quark is fractional
and is either 1/3e or +2/3e, and they possess a colour charge that can be either
red, blue or green. Quarks also have antimatter counterparts that possess the
same magnitude of electric charge with the opposite sign, and colours that are
antired, antiblue and antigreen. Quarks with different colour charge attract each
other, as do quarkantiquark pairs of the same colouranticolour (e.g. red and
antired). The colour charge is associated with the strong force and is constantly
changing as they interact with gluons.
Quarks do not exist as bare entities, and only form composite particles called
hadrons. Within this classification, quarks can either pair with an antiquark to
form a meson, or three quarks can combine to form a baryon. All hadrons have
a total electric charge of 1e, and the total colour charge will sum to white
(achieved by the quarks in mesons having a colour and an anticolour, or in
baryons by the combination of all the quarks having different colours).
atom
neutron
electron
quarks
matter
proton
nucleus
Matter
particles
These
particles
transmit the
four
fundamental
forces
of nature
although
gravitons
have so far
not been
discovered
1st FAMILY
Force
particles
QUARKS
Electron neutrino
Particle with no electric charge,
and possibly no mass; billions
fly through your body every second
Up
Has an electric charge of
plus two-thirds; protons contain two,
neutrons contain one
Down
Has an electric charge of minus
one-third; protons contain one,
neutrons contain two
2nd FAMILY
These
particles
existed just
after the
Big Bang.
Now they are
found only
in cosmic
rays and
accelerators
LEPTONS
Electron
Responsible for electricity
and chemical reactions:
it has a charge of 1
Muon
A heavier relative of the
electron; it lives for twomillionths of a second
Muon neutrino
Created along with muons
when some particles decay
Charm
A heavier relative of the up;
found in 1974
Strange
A heavier relative of the down;
found in 1964
Tau
Heavier still; it is extremely
unstable. It was discovered
in 1975
Tau neutrino
Recently discovered
Top
Heavier still
Bottom
Heavier still; measuring
bottom quarks is an important
test of electroweak theory
Photons
Particles that
make up light;
they carry the
electromagnetic
force
Intermediate
vector bosons
Carriers of the
weak force
3rd FAMILY
All ordinary
particles
belong to
this group
Gluons
Carriers of the
strong force
between quarks
Felt by:
quarks
The explosive release of nuclear energy
is the result of the strong force
Figure 15.5.1
W
Felt by:
quarks and charged
leptons
Z0
Gravitons
Carriers of
gravity
W+
Felt by:
quarks and leptons
Felt by:
all particles with mass
All the weight we experience is the
result of gravitational force
A tabular presentation of the three main components of the Standard Model and the three generations of particles classified
as quarks, as well as leptons and a description of the gauge bosons that mediate the four forces of nature
293
15
Charge
Colour charge
Up
+2/3e
Down
1/3e
Charm
+2/3e
Strange
1/3e
Top
+2/3e
Bottom
1/3e
Hadron
Example
Meson
(2 quarks)
Positive pion
Neutral kaon
Proton
Neutron
Sigma plus
Baryon
(3 quarks)
Symbol
K
p
n
+
Constituent
quarks
ud
ds
uud
udd
uus
No passing
proton
uud
neutron
udd
Leptons
There are six varieties and 12 types of leptons. The most commonly known is the
electron. All leptons appear to be fundamental point-like particles. The electron,
the heavier muon and the massive tau all have an electric charge of 1e and
their antimatter counterparts have a charge of +1e. Each of these leptons has
an associated neutrino that is named accordingly; for example, the electron (e)
has an electron neutrino (e).
Checkpoint 15.5
1
2
294
Draw a chart that shows the classes and subclasses of particles, starting with hadrons.
List the properties of quarks, hadrons and leptons.
PRACTICAL EXPERIENCES
quanta to
quarks
CHAPTER 15
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
295
15
296
Chapter summary
Review questions
quanta to
quarks
Physically speaking
Very quarky
1
3
6
7
8
9
10
11
12
13
14
15
16
17
18
19
Across
1 Discovered the neutron
Down
2 Type of accelerator
10 Atom smasher
5 A project
8 An electron is one
17 A type of scattering
11 Lived 18711937
19 Type of reactor
13 A constant R
14 They are the reason for the strong force
16 Fermis first name
297
15
Reviewing
Re
iew
298
Q uesti o
12
13
14
15
Photon
Strong force
Graviton
20 Explain why the top quark was the last of the six
quarks to be found.
quanta to
quarks
PHYSICS FOCUS
Linking the very big with
the very small
The very big and the very small are inextricably linked
as cosmology and particle physics enter a new era of
research. Many of the recent developments in
cosmology have depended on advances in high-energy
physics.
1 Explain why it is necessary to accelerate particles
to extremely high velocities.
2 Describe how high-energy accelerators such as the
Large Hadron Collider can test aspects of theories
such as the Big Bang.
Extension
3 Research how high-energy physics was and is still
carried out using cosmic rays.
299
Multiple choice
(1 mark each)
1 Which of the following lists only contains people who
provided science with atomic models in the period
up until 1913?
A Bohr, Democritus, Heisenberg, Schrodinger
B Aristotle, Bohr, Pauli, Rutherford
C Bohr, Dalton, Rutherford, Thomson
D Bohr, Einstein, Rutherford, Schrodinger
300
quanta to
quarks
Short response
1
Quark
Up
Charge
Mass
GeV/c2
Down
Strange
Charm
Bottom
KE
Top
2
+ e
3
1
e
3
1
e
3
2
+ e
3
1
e
3
2
+ e
3
0.005
0.01
0.2
1.5
4.7
180
Extended response
6
301
5
Context
Figure 16.0.1
medical
physics
The human body is a complex machine and many things can go wrong with it.
However, unlike a machine, we cannot do a total shutdown to locate and correct a
problem. Most of the functions of our body need to continue working when we are
looking for a problem and when we are trying to fix it. This is where physics comes
to the aid of modern medicine by providing the tools to investigate and, in some
cases, repair problems in the living body.
We see using light, and optical-fibre technology offers a minimally invasive way
of looking inside the body to actually see what may have gone wrong. Listening is also
an important way for doctors to find evidence of problems but, remarkably, we can
now use sound to look deep within the body without ever breaching the surface.
Although they present some potential damage to human tissues, X-rays can also
give us vital information of structural problems below the surface. For more than a
century, X-rays offered the only non-invasive way to see inside the body. Now, when
combined with enhanced computer technologies, X-rays provide powerful tools for
creating images of the structures within the body.
Modern physics also offers magnetic resonance imaging (MRI) as a powerful
method to investigate both the form and function of individual tissues. Carefully chosen
radioisotopes offer similar possibilities, because they can be used
to identify where atoms are accumulating within our
bodies. Other radioisotopes provide a therapeutic
tool to help combat certain problems.
None of these techniques will reveal
everything, but physics has provided
doctors with a range of diagnostic
and therapeutic tools that earlier
generations could never have
imagined.
302
3% Loss
5% Loss
Figure 16.0.2
Inquiry Activity
What makes you tick?
The ancient Egyptians knew that the heart was centre of blood supply in the
human body. Its pretty obvious when you look inside the body. However, just how it
worked remained a mystery, because looking inside dead bodies was not permitted
in many societies. Great advances were made by Andreas Versalius (15141564)
in the 16th century precisely because dissection was permitted in Italy in some
circumstances. In 1543 he published De Humani Corporis Fabrica (On the Fabric
of the Human Body), which marks the foundation of the scientific study of
human anatomy.
The function of many of the organs of the human body has been poorly
understood until recentlynone more so than the brain. However, modern imaging
techniques such as PET and MRI are slowly revealing the operation of the brain
by uncovering what parts are involved in different actions or affected by
different diseases.
In this exercise you should seek out reliable sources of information on the web
to investigate the difference between modern structural and functional imaging
of the brain.
1 Define what is meant by the structural and functional imaging.
2 How would you classify the images in Figures 16.0.1 and 16.0.2are they
structural or functional?
3 What imaging techniques allow structural differences to be seen in the brain?
4 What diseases might be revealed by structural images of the brain?
5 What imaging techniques allow functional differences to be seen in the brain?
6 Outline one study of the brain that might be conducted using functional images.
7 How reliable are your sources? Why do you expect them to be reliable?
303
16
Imaging with
ultrasound
See like a bat
Bats, dolphins and some birds have evolved the ability to produce
and sense sound waves that are reflected from objects in their
surroundings, enabling them to hunt prey and to navigate in the dark.
The principle used by these animals has been applied to radar
(using radio waves) and sonar (using sound waves).
Since the 1960s, the principles of sonar have also been applied
for medical purposes. Ultrasound imaging is a medical procedure
used to produce pictures of the inside of the body by using sound
waves with a frequency much higher than that audible to humans.
compression
Air pressure
high
Distance
low
Figure 16.1.1
304
atmospheric
pressure
rarefaction
medical
physics
ultrasound waves with frequencies as high as 125 kHz for navigating and
sonar visualisation.
Ultrasound used for medical purposes has frequencies that range
from 500kHz to 30MHz, but for most imaging applications, the
ultrasound used is in the range from 3.5 MHz to about 10 MHz. The
exception is ultrasound used for imaging blood vessels in the body. Tiny
probes on the end of a catheter (tube) are inserted into the blood vessels and
operate at frequencies up to 30MHz.
The choice of ultrasound frequency that is most suitable for
producing a medical image is a compromise between conflicting criteria.
Higher frequency ultrasound produces images with a better resolution of
detail, but they have poorer penetration through tissues. Imaging depth into
tissue is limited by attenuation (absorption) of the ultrasound waves, and
this increases as the frequency is increased.
Checkpoint 16.1
1
2
3
Material
Air
Fat
Water
Brain
Liver
Kidney
Blood
Muscle
Lens of eye
Skull bone
Average soft tissue
330
1450
1480
1540
1550
1560
1570
1580
1620
4080
1540
PRACTICAL
EXPERIENCES
Activity 16.1
16
Seeing with
ultrasound
Figure 16.2.2
306
Figure 16.2.3
medical
physics
PHYSICS FEATURE
Obstetrics
Figure 16.2.4
Ultrasound of a foetus
307
16
Seeing with
ultrasound
Disadvantages
Many reflections occur within the body and therefore good imaging
is operator dependent.
An ultrasound image is harder to interpret than an X-ray image.
Checkpoint 16.2
1
2
3
4
force
+ voltage
voltage
force
Figure 16.3.1
b
PZT rods
epoxy resin
matrix
Figure 16.3.2
308
(a) PZT rods are embedded into plastic to construct transducers of various
shapes, such as (b) a convex array transducer.
medical
physics
(convex), resulting in a divergent beam (Figure 16.3.3). This is different from
earlier linear array transducers, which had parallel beams.
Convex array transducers use as many as 512 piezoelectric elements to display
a wedge or pie-shaped image. The size of the area of skin through which the
ultrasound enters the body (the acoustic window) depends on the curvature of
the transducer surface. There is generally a narrow window at the skin surface,
while still providing a wide angle deeper beneath the skin (Figure 16.3.4). The
narrow field of view close to the transducer makes structures near the transducer
difficult to view.
The smallest size of objects that can be imaged (resolution) using
ultrasound depends on the wavelength (or frequency). All waves have the property
of being diffracted by objects; that is, their paths are bent around the object.
After passing around the object, the paths of the waves recombine and undergo
interference (see in2 Physics @ Preliminary section 7.4), which tends to smudge
the reflection of the object.
If the size of the object is approximately one wavelength or less, the smudging
becomes so significant that the individual object is not visible, limiting the ability
to resolve fine detail. This diffraction limit applies to all waves. In ultrasound,
this limits resolution both in the direction along the beam (axial) and across the
beam (lateral) and is summarised in the Table 16.3.1.
Low frequency waves are more effective in penetrating human tissue. This
results in a dilemma for ultrasound imaging. In order to produce a clear image
of an object deep in the body, low frequency waves should be chosen over high
frequency waves. However, to obtain a clearer image of smaller structures,
a high frequency needs to be used. Clearly the final choice is a compromise.
data processing
computer
convex
array
organs
sector scan
Figure 16.3.3
monitor
Figure 16.3.4
MHz
3.0
4.0
5.0
7.5
10.0
Axial
resolution
1.1
0.8
0.6
0.4
0.3
mm
mm
mm
mm
mm
lateral
resolution
2.8
1.5
1.2
1.0
1.0
mm
mm
mm
mm
mm
Checkpoint 16.3
1
2
3
4
16
Seeing with
ultrasound
The ability of a medium to transmit sound varies between materials. Even for a
The acoustic
single material, it also varies with the frequency of the sound.
impedance describes how readily a specific sound frequency will pass through
a material. It is defined as the product of the density (in units of kg m3) of the
material and the acoustic velocity u (in m s1) in the material:
Z therefore has units of kg m
Z =
s , which is called a rayl.
2 1
Worked example
Question
Bone has a density of 2 103 kg m3. The speed of sound in bone is 4080 m s1.
Calculate the acoustic impedance of bone.
= 2 103 kg m3
= 4080 m s1
Solution
Z = = 2 103 kg m3 4080 m s1
Z = 8.16 106 rayl
Ir Z 2 Z1
=
Io Z + Z 2
2
1
Identify that the greater the
difference in acoustic
impedance between two
materials the greater the
reflected proportion of the
incident pulse.
310
It = IO Ir
Z2 = 22
Figure 16.4.1
Reflection occurs at a
boundary between two
materials with different
acoustic impedance.
medical
physics
Table 16.4.1 Acoustic impedance of ultrasound in human tissue
Density
(kg m3)
Ultrasound velocity
(m s1)
1000
1540
1.54 106
920
1450
1.33 106
Lens of eye
1100
1620
1.78 106
1040
1541
1.60 106
1040
1541
1.60 106
Muscle
1040
1585
1.65 106
Kidney
1040
1561
1.62 106
Blood
1060
1570
1.66 106
Liver
1065
1549
1.65 106
1810
4080
7.38 106
Skull bone
1910
4080
7.79 106
Substance
Water
Fat
Impedance
(rayl)
Worked examples
Question
Calculate the percentage of ultrasound intensity reflected at a boundary between fat and muscle.
Fat:
Muscle:
Solution
I r Z 2 Z 1
[(1.648 106 ) (1.334 106 )]2
=
= 0.011
=
Io Z + Z 2 [(1.648 106 ) + (1.334 106 )]2
2 1
Hence only 0.011 of the incident intensity is reflected. This is 1.1% of the incoming intensity.
Ir Z 2 Z1
=
Io Z + Z 2
2
1
Question
Calculate the percentage of ultrasound intensity reflected at a boundary between air and muscle.
Air:
Muscle:
Solution
(
(
) (
) (
6
6
2
I r Z 2 Z 1 1.648 10 0.0004 10
=
=
I o Z + Z 2 1.648 10 6 + 0.0004 10 6
2 1
) 2 = 0.999
) 2
Hence 99.9% of the incoming intensity is reflected. Since intensity is proportional to energy,
we can say 99.9% of the energy is reflected.
Bone has a much higher acoustic impedance than other tissues, so most of the
ultrasound energy that encounters bone is reflected. For this reason, ultrasound
cannot image the internal structure of bone or a brain inside an adult skull. Some
ultrasound penetrates the skull of a foetus, because the bone has not yet calcified
it is cartilage, similar to the tissue that gives ears their semi-rigid nature.
If ultrasound travelling from air or other gas strikes a soft-tissue boundary of
any type, then most of the ultrasound energy is reflected. For this reason, when
an ultrasound is being carried out, a gel is placed on the skin and the ultrasound
probe is moved over this gel. The most important function of the gel is to exclude
any air from the region between the transducer and the skin. The gel has an acoustic
311
16
Seeing with
ultrasound
impedance similar to that of human tissue. The term used to describe this
Impedance
process is impedance matching or acoustic coupling.
matching between two media results in most of the energy being transmitted
through the interface, with almost no reflection.
A further complication arises at the boundary between two different tissues,
because ultrasound is also refracted at a boundary (see section 18.1 and in2
Physics @ Preliminary section 8.3). This bending of the sound path can
complicate the analysis of the data.
Checkpoint 16.4
1
2
Describe the conditions necessary for reflection of ultrasound waves to occur at a boundary.
Calculate the percentage of ultrasound energy that reflects from a muscle and bone boundary. (Use values in
Table 16.4.1 to help.)
Explain why it is necessary to use gel on the transducer in order to get clear images.
A-mode scans
1200
high-frequency signal
absolute value of signal
Hibert envelope of signal
Amplitude
800
400
0
400
800
1200
Figure 16.5.1
312
medical
physics
foetal
skull
probe
spine
probe
organ
to scan
display
vertebra
pulse
echo
transducer
A scan
display
Echo strength
B-mode scans
Time
B scan
display
Figure 16.5.2
ultrasound image
synthesised
from scans
Figure 16.5.3
M-mode scans
A single-ultrasound beam can be used to
produce an M-mode image in which movement
of an individual structure such as a heart valve
can be displayed versus time. An M-mode scan
uses a high-sampling frequency (up to 1000
pulses per second) to capture the rapid motion.
This display is essentially a series of B-mode
scans lined up side-by-side in time (Figure
16.5.4).
2D real-time scans
Using a convex-array transducer with many
piezoelectric elements, beams of ultrasound can
be launched in different directions into the
body. Each beam produces a B-mode scan.
Once all the echoes from the first beam have
been picked up, the transducer sends a second
Figure 16.5.4 An M-mode scan through the left ventricle of the heart allows
pulse along a slightly different beam direction
measurement of the ventricle size (green markers) as the heart beats.
into the tissue. Echoes are again picked up and
sent for processing to produce an image, and
another pulse is launched in another direction, and so on. Like the beam of
a searchlight swept across the night sky, the pulsed beam from an ultrasound
transducer is swept through the body, mapping out reflecting surfaces and forming
a 2D real-time scan. The echos image is two-dimensional and shows detail along
313
16
Seeing with
ultrasound
one slice through the body, originating at the transducer. Beams are swept and
ultrasound images are formed very rapidly in real-time images. As the operator
holds the transducer in contact with the skin, the image appears live on a video
monitor. Real-time images are usually made with a sector scanner, so the images
will be pie shaped.
The B-mode scan is continuously updated and displayed in real time, with
the image changing as the tissues move in the field of the ultrasound beam.
The movement of the object being scanned can be monitored on the image.
Interpreting ultrasound images requires considerable skill and knowledge,
as the images in Figures 16.5.5 and 16.5.6 demonstrate. An untrained observer
would have difficulty making sense of the image without explanation.
Figure 16.5.5
Figure 16.5.6
Figure 16.5.7
Checkpoint 16.5
1
2
Figure 16.5.8
314
Example of a 3D ultrasound
image of a foetus
medical
physics
Figure 16.6.1
PRACTICAL
EXPERIENCES
Activity 16.2
315
16
Seeing with
ultrasound
The Doppler shift effect has been used for a long time in foetal heart rate
detectors. Further developments in Doppler ultrasound technology in recent
years have resulted in its increased use in obstetrics to assess and monitor the
well-being of a foetus.
PRACTICAL
EXPERIENCES
Activity 16.3
skin
beam direction
Doppler again
and again
Figure 16.6.3
vessel
flow
B
Sonogram
Figure 16.6.2
Echocardiography
Echocardiography is the name given to ultrasound imaging of the heart. It is
used to investigate heart-valve function, ventricular function, congenital heart
diseases (including holes in the septum separating the left and right halves of
the heart), cardiac tumours and obstructions in cardiac blood vessels.
Pulsed ultrasound from a transducer is used and a continuous
recording is made of the echoes received from the various parts of the heart,
showing the motion of the various parts. A standard transthoracic
echocardiogram can be conducted through the thorax (chest wall). Higher
resolution imaging can be achieved by a trans-oesophageal echocardiogram in
which a transducer on a specialised probe is passed down the patients
oesophagus, thereby getting much closer to the heart.
Blood-flow characteristics in the foetal blood vessels can also be examined
using Doppler ultrasound. Diminished flow, particularly in the diastolic
phase of a pulse cycle (as the heart muscle relaxes) is associated with problems
with the circulation of blood in the foetus. Colour Doppler can also be used
to assess the success of coronary bypass surgery, and is particularly useful in
the diagnosis and assessment of congenital heart abnormalities such as atrial
or ventricular septal defects (a hole between the atria or the ventricles; see
Figure 16.6.3).
Checkpoint 16.6
1 Discuss why the heel is an ideal place to use ultrasound to determine bone density.
2 Explain how the Doppler effect can help diagnose the well-being of a foetal heart.
3 Outline how echocardiography works.
316
PRACTICAL EXPERIENCES
medical
physics
CHAPTER 16
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
317
16
Seeing with
ultrasound
Chapter summary
Review questions
Physically Speaking
Below is a list of topics that have been discussed throughout this chapter. Create a visual summary of the
concepts in this chapter by constructing a mind map linking the terms. Add diagrams where useful.
318
Ultrasound
Transducer
Doppler effect
Frequency
Piezoelectric
Acoustic impedance
Reflection
Linear array
Impedance matching
Bone-density measurement
Echocardiography
A-mode scan
B-mode scan
Sector scan
Blood flow
medical
physics
Reviewing
Solving Problems
22 Bone has a density of 2103 kg m3. The speed of
24
I r Z 2 Z 1
=
Io Z + Z 2
2
1
Re
iew
Q uesti o
319
17
X-rays, cathode, radiographs,
tomography, computed axial
tomography (CAT), characteristicX-rays,
Bremsstrahlung, anode, radiographer,
filter, collimator, grid, fluorescent,
contrastagents
Imaging
with X-rays
The invisible reveals the hidden
Wilhelm Rntgen (18451923) was studying cathode rays in a Crookes
tube wrapped in cardboard, when he noticed a faint green glow from
the fluorescent screen painted with barium platinocyanidethe rays
emitted from the tube were passing through the cardboard. He found
the rays could also pass through other papers and books. This
phenomenon intrigued Rntgen and he dedicated the next few months
to systematically investigating the properties of these rays. He named
this new type of radiation X-rays.
On 22 December 1895, Rntgen took the first X-ray, an image of the
hand of his wife Anna. After this, many scientists and companies began
investigating X-rays. Thomas Edison began a rigorous investigation into
fluorescent materials. He developed the fluoroscope using calcium
tungstate, which became the standard for medical X-ray examinations.
Today, with the development of semiconductor detectors and imageprocessing software, X-rays are used in computed axial tomography
(CAT scans) to provide medical doctors with high-resolution 3D images
of skeletal, vascular and soft tissue.
320
medical
physics
A range of film-based and fluoroscope devices were developed and refined,
providing the familiar two-dimensional X-ray radiographs. In 1930, Alessandro
Vallebona (18991987) proposed the idea of tomography in which a single
slice of the body could be recorded on radiographic film. In the 1970s the
development of affordable computers capable of processing image data, and
the parallel development of semiconductors and image intensifiers allowed a new
generation of X-ray devices to be developed. The technology was now available
to re-engineer Vallebonas tomography. Allan McLeod Cormack (19241998)
developed the theory and Godfrey Hounsfield (19192004) independently built
a prototype of the new generation of X-ray tomography scanner. This became
known as computed tomography (CT) or computed axial tomography (also
called CAT).
Today, with the development of more sensitive detectors and advances in
image processing software, CAT scans can provide two- and three-dimensional
images with resolutions of 1cm. The combination of traditional radiographs and
CAT scans provides the medical profession with important therapeutic and
imaging diagnostic tools.
Checkpoint 17.1
1
2
glass envelope
bearings
rotor/anode
support
cathode
block
rotor
filament
focusing
cup
electrons
target
exit window
Figure 17.2.1
Schematic representation of
a rotating-anode diagnostic
X-ray tube
321
17
Imaging
with X-rays
The tube is highly evacuated and generally constructed from glass. It contains
an anode and a cathode. A potential difference of between 20000 and 300000volts
is applied between the cathode and the anode. The cathode has a small filament
(similar to an incandescent lamp filament), which is heated to a dull red glow by
a small current. The hot filament readily releases electrons and the high potential
difference then accelerates these electrons from the cathode towards the target
anode. The cup shape of the cathode behind the filament focuses the electron
beam towards a point on the target anode. When the electrons hit the angled
tungsten anode, X-rays are emitted through the exit window at the side of the
X-ray tube. Because of the inefficiency of the process, a huge amount of thermal
energy is produced. To ensure the tungsten anode does not melt, it is often rotated
to more evenly distribute the energy. The anode is often mounted on a copper
heat-sink that may also have an additional cooling system attached. Shielding, to
reduce the emission of X-rays in unwanted directions, surrounds the tube itself.
Checkpoint 17.2
1
2
3
X-rays are often categorised as hard and soft X-rays, descriptive terms
that indicate the relative penetrating ability of the X-ray beam. Soft X-rays do
not penetrate through body tissue and are absorbed easily. These absorbed X-rays
are not useful for producing radiograph images and they pose an increased risk
to patients, of genetic mutations and cancers. X-ray machines have aluminium
filters that attenuate these lower energy soft X-rays from the X-ray beam. Hard
X-rays have a higher energy and greater penetration into and through tissue, thus
producing a sharper image.
Changing the potential difference between the cathode and anode can vary
the distribution of hard and soft X-rays produced by the X-ray tube. Increasing
the voltage to the tube will produce more hard X-rays. The radiographer can
adjust the voltage, to obtain the best quality image of the particular organ, tissue,
vessel or bone under examination.
Table 17.3.1 Properties of X-rays
322
Hard X-rays
Soft X-rays
medical
physics
Characteristic X-rays
ejected
ei
2p
1s
2s
e
electron
e
2p
1s
2s
characteristic X-ray
tic
is
ter
c
ara
ch
ay
X-r
2 posible paths
Au
to fill the gap
ger
ele
ctr
o
n
e
2p
e
1s
2s
0.02 0.04
0.12
Wavelength (nm)
Figure 17.3.1
X-ray wavelength
10
Relative intensity
Figure 17.3.2
X-rays from a
molybdenum
target at 35 kV
a gap remains
characteristic
X-rays
Brehmsstrahlung
continuum
X-ray intensity
incident
electron
3
Relative intensity
The accelerated electrons can interact with and knock out an electron from
the inner electron shell of the anode. If an outer shell electron drops down and
fills the vacant position, an X-ray photon of a specific energy and wavelength,
peculiar to the elements of the target will be produced. In the total X-ray
spectrum, the characteristic X-rays appear as sharp well-defined peaks.
Figure 17.3.1 shows the characteristic peaks for X-rays produced by a
molybdenum target anode bombarded by electrons accelerated through
a potential of 35kV. The details of the process are shown in Figure 17.3.2.
Bremsstrahlung
The word Bremsstrahlung means braking radiation in German. It is used
to describe the radiation emitted when electrons are decelerated or braked, when
fired into a metal target. When an electron rapidly changes its velocity, the lost
kinetic energy is converted into photons of electromagnetic radiation. When the
energy of the incident electrons is high enough, the radiation emitted will be
X-rays. Decelerating electrons from the beam interact and produce a continuous
distribution of radiation.
Figure 17.3.3 shows a set of curves for a tungsten target with electron beams
of four different energies. The details of the Bremsstrahlung process are shown
in Figure 17.3.4.
40 kV
6
30 kV
2
20 kV
0.02
0.04
0.06
0.08
0.10
Wavelength (nm)
Figure 17.3.3
ei incident electron
incident electron
ei
hard X-ray
ei
Figure 17.3.4
X-ray intensity
soft X-ray
Wavelength (nm)
ei
17
Imaging
with X-rays
Checkpoint 17.3
1
2
3
4
Conventional radiographs
In traditional radiographs, fluorescent screens absorb the energy in the X-ray
beam that has penetrated the patient. This energy is converted into visible light
that has same information as the original X-ray beam. This light is then used to
expose the X-ray film. The more efficient this light conversion, the less the patient
needs to be exposed to X-ray radiation. In most systems, the film is sandwiched
between two fluorescent screens in a cassette so that the film emulsion is exposed
from both sides. In modern X-ray machines, film is replaced by arrays of
electronic detectors.
Conventional X-ray systems are used to image a variety of patient conditions
including suspected skeletal bone fractures (Figure 17.4.2), cancer in the breast
(Figure 17.4.3) and lung conditions.
Figure 17.4.1
324
Figure 17.4.2
medical
physics
Figure 17.4.4
Figure 17.4.5
Checkpoint 17.4
1
2
3
17
Imaging
with X-rays
Checkpoint 17.5
1
2
Discuss how image intensifiers have changed what doctors can do in diagnosis.
List the benefits of digital technology.
326
Basic design
There are several designs of CAT scanners in use. One commonly used scanner
has the patient placed in a fixed position on a bed. The X-ray tube and detector
array are attached to a movable C-shaped gantry that is rotated 180 around the
patient and then progresses forward and records the next set of scans. Another
more modern system has a moveable bed is that moved slowly through a fixed
toroidal structure (a donut) that houses the X-ray tube and detector. As the bed
progresses, the X-ray tube and detectors continually rotate around the toroidal
structure, scanning the patient in a spiral motion.
medical
physics
X-ray
tube
45
n
sc
a
13
detectors
detector
array
c
fan
beam
direction of
X-ray tube
motion
patient
Figure 17.6.2
n
180
X-ray source
collimator
X-ray
source
scan 90
a
sc
detector
A Long Wait
X-ray
source
ring of
stationary
detectors
The history of CAT: (a) A pencil-beam X-ray source and a detector scan linearly
across the patient from a series of angles. (b) A fan-beam X-ray source and an arc
of detectors rotate around the patient. (c) Only the fan-beam X-ray source rotates;
the detector ring is stationary.
X-rays
X-rays
5 4 2 4 5
10
9
7
9
10
9
8
6
8
9
7
6
4
6
7
9
8
6
8
9
5
4
2
4
5
10
9
7
9
10
Figure 17.6.3
327
17
Imaging
with X-rays
The image generated shows some sort of cross structure but it is fuzzy. Now
if we reassign the grey scale and assign light grey to all voxel values greater than 6
and a dark grey to all voxels with values of 6 or less, we obtain a clearer image of
the original structure (Figure 17.6.3d).
This is the basic process of a CAT scan. For a real CAT scan in which there
are complex sets of absorption calculations, a lot of data processing is required to
recreate a high-resolution image from the data within a second. Figure 17.6.4
shows a single two-dimensional slice from a scan.
Because the data contains three-dimensional information, the imaging
software can combine a series of slices to produce a three-dimensional image.
Computer imaging software can also remove distracting tissues from the scan
images and generate an image that shows the specific organ, tissue or structure
under examination. Figure 17.6.5 shows a CAT angiogram of the blood vessels
in the brain, with the surrounding tissue removed.
Common applications for CAT scans include:
identifying trauma injuries to the lungs, heart, spleen, kidneys and liver
planning for and assessing the results of surgery
planning radiation treatments for tumours
detecting osteoporosis by measuring bone density.
Figure 17.6.4
Checkpoint 17.6
1
2
3
4
328
Figure 17.6.5
medical
physics
PRACTICAL
EXPERIENCES
Activity 17.1
Checkpoint 17.7
1
2
Identify the uses of CAT for which a radiograph can not be used.
List problems that CAT scans can overcome that ultrasound imaging can not.
329
17
Imaging
with X-rays
PRACTICAL EXPERIENCES
CHAPTER 17
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
330
Chapter summary
medical
physics
Review questions
Physically speaking
This is a list of topics that have been discussed
throughout this chapter. Create a visual
summary of the concepts in this chapter by
constructing a mind map linking the terms.
Add diagrams where useful.
X-ray
Hard
Soft
Characteristic
Bremsstrahlung
Penetration
ability
Resolution
Diagnostic
Benefits
2D scan
Image
intensifier
331
17
Imaging
with X-rays
Reviewing
1 Outline two ways in which X-rays are produced in an
X-ray tube.
Re
iew
332
Q uesti o
Figure 17.8.1
Imaging
with light
Look and see
Optical fibres have had a great impact on society, although the fibres
themselves are usually hidden from view. One of those impacts is in a
doctors endoscope, a device to peer inside the human body without
major surgery. Optical fibres allow light to be directed onto an area of
interest and then lenses relay an image back to the doctor. Combined
with tiny surgical instruments, these devices allow minimally invasive
surgery to reduce the trauma of surgery, having a radical effect on
the treatment and recovery of patients with problems ranging from
a torn knee cartilage to leaky heart valves.
18
endoscope, endoscopy, total internal
reflection, critical angle, optical fibres,
core, cladding, coherent fibre bundle,
non-coherent fibre bundle, biopsy
18.1 Endoscopy
An endoscope is a medical device used to shine visible light into a patients
body and relay an image out, to allow organs, tissues and cavities to be seen.
Light is directed to the area of interest through a flexible bundle of optical fibres.
Lenses are used to focus an image of the area of interest onto a separate bundle
of optical fibres that transfer the image out of the body. The image is usually
processed electronically to display it on a screen in real time or to record still or
moving images for later use.
The process of using an endoscope to examine a patient is called endoscopy.
The tube-like part of the endoscope, which enters the body, is typically less than a
centimetre in diameter. Depending on the purpose of the endoscopy, the tube is
inserted via a natural opening of the body, or through a small cut made in the skin.
Miniature surgical instruments can be attached to the part that enters the body and
these are controlled from outside the body by a doctor using control wires that pass
through another narrow tube that is also a part of the endoscope. Tissue samples
can be removed from the patient and surgical procedures can be carried out.
The main advantages of endoscopy are that it is minimally invasive
and provides real-time images in true colour. The operator can manipulate the
endoscope to obtain the best view of the area of interest inside the patient.
Figure 18.1.1
333
18
Imaging
with light
PHYSICS FEATURE
Optical fibres
ni
nr
normal
normal
normal
air
nr
air
water
ni
water
air
ni
water
Figure 18.1.2
nr
90
nr
air
nr
ni
water
ni
Light moving from high to low refractive index materials may be totally internally reflected.
jacket
core
core
cladding
cladding
Figure 18.1.3
334
Figure 18.1.4
A cross-section of an optical fibre
medical
physics
The main parts of an endoscope are shown in Figure
18.1.5. The endoscopic tube that is inserted into the patient
typically contains the following parts:
a bundle of optical fibres to transmit light to the point
of observation
a coherent bundle of optical fibres to carry the image of
the tissue to the observer
a system of lenses to focus an image of the tissues under
examination onto the optical fibre bundle. At the viewing
end of the optical fibre bundle, more lenses allow direct
viewing of the image by eye, or connect to a camera that
feeds a video screen or computer
suction tube to remove blood and other loose, obscuring
tissue material from the area under inspection
an inlet and an outlet to permit the area under observation
to be flushed with clear saline (salt water) solution to
increase visibility
control lines to manipulate the tube inside the patient.
These vary in complexity, depending on the task undertaken.
A simple gastroscopy may not require complex controls.
Endoscopes used for surgical procedures may require fine
control over the position of the lens at the tissue end
miniature remote-controlled surgical instruments may be
present, ranging from simple suction tools to more
elaborate surgical tools used in operations.
power source
video out etc.
The endoscope
is a thin,
flexible fibreoptic telescope.
light to
see inside
Various devices
can be passed
down side channels.
These can be
manipulated by
the doctor to take
specimens etc.
doctor looks
down endoscpe
oesophagus
stomach
duodenum
Figure 18.1.5
Worked example
Question
Calculate the critical angle for a glass optical fibre with a core refractive index of 1.48
and a cladding refractive index of 1.46.
PRACTICAL
EXPERIENCES
Activity 18.1
Solution
The critical angle will be a value of i such that the angle of refraction is 90.
ni = 1.48, nr = 1.46, r = 90
Snells law states:
ni sini = nr sinr
n sin r
i = sin1 r
ni
1.46 sin 90
= 80.5
i = sin1
1.48
18
Imaging
with light
is said to be a coherent fibre bundle. The light from an object projected onto
the ends of the fibre bundle by a simple lens will travel along the fibres and
emerge as a corresponding image of the object at the other end of the bundle
(Figure 18.1.6). This is the principle of the medical endoscope.
In a non-coherent fibre bundle, one or more of the fibres swap positions
relative to each other at opposite ends of the fibre. Typically, the arrangement of
fibres along the bundle is random, so that although the light travels along each
fibre, no clear image is produced at the exit end (Figure 18.1.7). A non-coherent
fibre bundle is adequate to simply transmit light to the point where observations
are being made with the endoscope. White light is used so that the doctor can
observe the tissues in true colour.
object
object
no image
image
1
2
3
4
5
1
2
3
4
5
Figure 18.1.6
1
2
3
4
5
Figure 18.1.7
2
4
5
1
3
Checkpoint 18.1
1
2
3
4
Outline how an optical fibre allows you to see something inside the body.
Define total internal refraction.
Explain why infra-red light is used in communication applications but visible light is used in medical applications.
Identify the difference between coherent and non-coherent bundles of fibres.
medical
physics
A risk in using an endoscope is that the part of the endoscope inside the body
can tear tissues while it is being moved about. Endoscopic examination of the
bowel presents a particular risk, because the bowel contains bacteria, which, if they
enter the bloodstream, can produce a fatal infection. However, the risk in using
an endoscope is much less than the risk encountered if the abdomen had to
be opened up in conventional surgery. The fact that a patient usually has to be
sedated or anaesthetised presents another minor risk to the patient, although this
type of risk is the lesser problem when weighed against the alternative of an
undiagnosed or untreated problem.
Endoscopes that have been modified with surgical instruments can be used
to remove tissue samples for testing. This process is called a biopsy, and this is
one of the most common endoscopic procedures.
A common example of a biopsy is the removal of polyps or other growths from
the intestine for further examination and testing. Endoscopy reduces this risk because
the incisions and amount of cutting is minimised. Figure 18.2.1 shows a biopsy
being performed within the abdomen. The sample of tissue cut off can then be placed
or sucked into a tube attached to the endoscope and withdrawn from the body.
Minimally invasive surgery is conducted using optical fibre instruments that
are often an integral part of the endoscope. Surgery that is commonly carried out
with the aid of an endoscope includes removal of the gall bladder and the prostate,
and repairs to damaged tissues in joints. A common joint operation is the repair
of the anterior cruciate ligament in the knee (Figure 18.2.2). This part of the
anatomy is frequently torn in sports such as netball and football, which involve
vigorous twisting forces on the knees. People once condemned to months off the
sporting field by knee injuries are now returning to their sport within weeks,
because of endoscope-aided surgery.
Figure 18.2.1
Figure 18.2.2
Developments in endoscopy
optical dome
lens holder
illuminating LEDs
lens
battery
antenna
Figure 18.2.3
An endoscopic capsule
Checkpoint 18.2
1
2
18
Imaging
with light
PRACTICAL EXPERIENCES
CHAPTER 18
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Perform a first-hand
investigation to demonstrate
the transfer of light by optical
fibres.
Gather secondary information
to observe internal organs
from images produced
by an endoscope.
Figure 18.3.1
338
Chapter summary
medical
physics
Review questions
Physically Speaking
Solving problems
Reviewing
1 Explain the importance of total internal reflection to
the operation of an endoscope.
Re
iew
Q uesti o
internal organs.
339
19
radioactive decay, radiation,
radioactive, radioisotopes, nucleons,
atomic number, mass number, isotopes,
alpha decay, alpha particle, beta decay,
beta particle, positron decay, antiparticle,
positron emission tomography (PET),
gamma decay, half-life,
radiopharmaceuticals, nuclear reactor,
cyclotron, gamma camera, bone scan,
collimator, scintillator, single-photon
emission computed tomography (SPECT)
Imaging with
gamma rays
Radioactivity can be good!
Images made using ultrasound, X-rays and visible light can show
anatomical structures rather well. They all involve sending various
forms of energy into the body. A rather different approach is to
introduce radioactive elements into a persons body and study the
radiation that emerges. Images can be made of the bones as well
as soft tissues including the brain, heart, liver and thyroid. Rather
remarkably, this approach allows the production of images from
outside the body that show how a persons organs are functioning.
Here we consider two of these minimally invasive but powerful
diagnostic tools: bone scans using radioactive tracers
and positron emission tomography.
340
For the 30 or so lightest elements, the number of protons is roughly the same
as the number of neutrons in the nucleus in most of their naturally occurring
isotopes. These isotopes are stable. However, many elements have isotopes whose
nuclei have too few or too many neutrons. These isotopes are unstable and
undergo radioactive decay in which they change and emit radiation. The type
of radiation that is emitted depends on the nature of the decay (see in2 Physics
@ Preliminary section 15.5).
There are 82 elements that have at least one stable isotope. The stability
depends on the ratio of protons to neutrons. As the atomic number increases,
the ratio of neutrons to protons needed for stability also increases.
Many elements have naturally occurring unstable isotopes. These are
called radioactive isotopes or radioisotopes. The nucleus of a radioisotope
(the parent nucleus) usually transforms itself into another nucleus (the daughter
nucleus) by emitting particles and energy. It will decay repeatedly until it forms
a daughter nucleus that is stable.
medical
physics
increasing
distance
Isotopes
maximum
shielding
Figure 19.1.1
Alpha decay
Some unstable nuclei decay by emitting a particle that
contains two protons and two neutrons in a process known as
alpha decay. The remaining nucleus has a mass number that is
reduced by 4 and an atomic number that is reduced by 2. This
particle emitted from the nucleus is called an alpha particle
(-particle). Alpha particles are helium nuclei ( 42 He ) and they
rapidly become helium atoms, as they gain electrons from the
surroundings. Such reactions are the source of most of the
helium on Earth.
For example, radioactive uranium-238 undergoes alpha decay
to produce thorium-234. The daughter nucleus has 2 protons
less than the parent nucleus and so it is a different element. In
a nuclear reaction, both mass number and charge are conserved,
and the decay process can be described by an equation:
238
92 U
234
90Th
electron
proton
Beta decay
When a radioactive nucleus undergoes beta decay, a
neutron changes into a proton, releasing a high-energy electron
in the process. The electron is ejected from the nucleus with such
a high velocity that it totally escapes the atom. An electron
(represented as e or 10e) emitted from the nucleus in this way is
called a beta particle (-particle). An electron has only 1/1836
1
1H hydrogen
neutron
2 neutrons
+ 42 He
proton
electron
Figure 19.1.2
proton
3
1H tritium
electron
2
1H deuterium
daughter nucleus
Th-234
parent nucleus
U-238
Figure 19.1.3
4
2He
alpha particle
(helium nucleus)
341
19
Imaging with
gamma rays
14
7N
14
6C
daughter
nucleus
parent
nucleus
Figure 19.1.4
beta
particle
of the mass of the proton, and its mass number is 0. Its negative charge gives it
an atomic number of 1.
Beta decay increases the atomic number of the nucleus by one but the mass
number does not change. There is one less neutron but one more proton and
hence the total number of nucleons does not change.
Carbon-14 decays by beta decay to nitrogen-14. The process can be described
using an equation or a diagram (Figure 19.1.4).
Carbon-14 decays to
nitrogen-14 and an electron.
14
6C
147 N +
0
1 e
The two products are a stable nitrogen nucleus and a beta particle (electron).
Beta particles have a range that depends on their energy. In air they travel several
metres, but they travel only a few millimetres in human tissues.
Some nuclei of radioactive elements are unstable because they have too many
These become
protons in the nucleus, relative to the number of neutrons.
more stable by a form of beta decay called positron decay. A positron
0
(represented as e+ or + or sometimes +1 e ) is the antiparticle of the electron.
It has the same mass as an electron and the same magnitude charge as the
electron, but it is positively charged. In positron decay, a proton in the nucleus
decays to a neutron and a positron. The atomic number of the decaying nucleus
decreases by one but the mass number remains the same. The positron is ejected
with high kinetic energy from the nucleus.
The most important positron-emitting isotope is fluorine-18, an artificially
produced isotope. Its decay equation is:
18
18
0
9 F 8 O + +1 e
positron
emitter
e+
Figure 19.1.5
342
e +
e
gamma ray
photon
The most widely used medical isotope is technetium-99. It is a beta emitter that
decays to ruthenium-99. The higher energy form of technetium-99, called
metastable technetium-99 or technetium-99m, is also unstable and becomes
more stable by emitting electromagnetic radiation. At the energy involved, this
The gamma ray carries away no charge or mass
radiation is a gamma ray.
and so the nucleus remains technetium-99. It is the gamma decay of
technetium-99m that is the medically important decay.
medical
physics
Gamma decay also frequently accompanies alpha or beta decay. Cobalt-60 is
a well-known beta and gamma emitter.
Gamma rays have no charge and so can pass easily through matter. To achieve
the 95% absorption of the gamma energy from a cobalt-60 source, the rays must
pass through 60 mm of lead, 100 mm of iron, or 330 mm of concrete.
alpha
beta
X-ray (medical)
gamma ray
neutrons
concrete
human hand
Figure 19.1.6
aluminium
thin
lead
thicker
lead
Checkpoint 19.1
1
2
3
Atom-sized
energy units
19.2 Half-life
Amount (grams)
The time it takes for half the mass of the parent isotope to decay into
Outline properties of
daughter nuclei is defined as the half-life of the isotope. After one half-life, only
radioactive isotopes and their
50% of the original parent isotope remains; 50% of that remaining amount
half-lives that are used to
obtain scans of organs.
decays after another half-life, leaving just 25% of the original parent isotope
and so on.
The half-life of a radioactive isotope can be deduced
from a graph showing the mass of the remaining
2000
Strontium-90 decay
radioactive atoms of the element plotted against time. In
1800
Figure 19.2.1, the time taken for 2000 g of strontium-90
1600
to be reduced to 1000 g is 28.1 years, the half-life of
1400
1200
strontium-90. The daughter isotope is yttrium-90, which
1000
rapidly decays to zirconium-90, which is stable.
800
This mathematical model, called exponential decay, is
600
applicable to all forms of radioactive decay. The rate of
400
200
radioactive decay of an isotope is not affected by changes
0
in physical conditions such as temperature or pressure.
0.0 28.1 56.2 84.3 112.4 140.5 168.6 196.7 224.8 252.9 281.0 309.1
Time (years)
The decay rate is unchanged by any chemical reactions
(or compounds in which the radioactive isotope may be
Figure 19.2.1 The mass of strontium-90 remaining versus time,
incorporated). Every radioisotope has its own half-life.
from an original sample of 2000 g
343
19
Imaging with
gamma rays
Checkpoint 19.2
1
2
Define half-life.
Outline how you would find the half-life of an isotope from a graph showing its mass in a sample versus time.
19.3 Radiopharmaceuticals:
targeting tissues and organs
Outline properties of
radioactive isotopes and their
half-lives that are used to
obtain scans of organs.
Radioisotopes are used to produce functional images of the body. They are used
to examine blood flow to the brain, to assess functioning of the liver, lungs, heart
or kidneys, to assess bone damage, and to confirm other diagnostic procedures.
In contrast to the imaging techniques already discussedultrasound images,
X-ray images and CAT scansthe use of radioisotopes can show how the body is
functioning, rather than simply showing detailed images of tissues and organs in
the body. Diseases such as cancer alter chemical processes in the body and images
produced using radioisotopes can reveal these changes. Radioactive chemicals
used in medicine are called radiopharmaceuticals.
344
medical
physics
Radiopharmaceuticals are used in very small quantities for diagnostic work.
Just enough is administered to obtain the required information before the
radiopharmaceutical decays, therefore minimising cell damage from the radiation.
The radiation dose received is similar to that from diagnostic X-rays.
Figure 19.3.1
half-life
Cobalt-60
5.3 years
Iodine-131
8 days
Phosphorus-32
14.3 days
Molybdenum-99
2.75 days
Technetium-99m
6 hours
Emission
,
,
Uses
External beam radiotherapy
Cancer diagnosis and imaging of the thyroid gland
Treatment of excess red blood cells
Parent isotope used in a generator to produce technetium-99m, the most
widely used isotope in nuclear medicine
To image the skeleton and heart muscle in particular, but also for brain,
thyroid, lungs, liver, spleen, kidney, gall bladder, bone marrow, salivary and
lacrimal glands, heart blood pool, infection and numerous specialised
medical studies
Half-life
Carbon-11
20.3 minutes
Emission Uses
+
Nitrogen-13
10 minutes
Oxygen-15
2.03 minutes
Fluorine-18
109.8 minutes
Iodine-124
4.5 days
+
+
+
+
345
19
Imaging with
gamma rays
PHYSICS FEATURE
Technetium-99m
99 m
99
42Mo 43 Tc
99 m
43 Tc
99
43Tc +
Checkpoint 19.3
1
2
3
4
gamma photons
Figure 19.4.1
346
Principle of a
gamma camera
medical
physics
directions. Unlike visible light, gamma rays cannot be focused to form an image.
Instead a lead collimator with parallel holes passing from the bottom to the top is
used to allow only those gamma rays directed through the holes to
reach the detector itself.
The detector is a scintillator, made of a crystal of sodium iodide containing
traces of thallium. The thallium impurity causes the crystal to scintillate
(i.e. to emit a small flash of light) when a gamma ray enters the crystal. The flash
is very faint, so sensitive devices called photomultipliers are used to detect the
flash and amplify its effects, converting the result into an electrical signal.
A computer processes the signals to produce an image that may be viewed
directly on a screen, printed or stored electronically, since the information is
all digitally encoded.
Regions of bone having a high metabolic activity appear darker on the bone
scan because more of the radiopharmaceutical is absorbed (Figure 19.4.2).
Conditions that may cause increased metabolic activity include cancer cells
multiplying in the bone, healing of a fracture or damage from arthritis.
A gamma camera is also used in single-emission computed tomography
(SPECT). The camera acquires two-dimensional images from multiple directions.
These are then assembled into a three-dimensional image in a computer, in a
similar way to other three-dimensional tomographic imaging techniques.
PRACTICAL
EXPERIENCES
Activity 19.1
Checkpoint 19.4
1
2
347
19
Imaging with
gamma rays
seeing words
hearing words
Figure 19.5.1
b
detectors
coincidence
unit
Figure 19.5.2
348
valid
event
(a) A PET scanner showing coincidence detection of gamma rays by opposite detectors and (b) an example of a PET scanner
medical
physics
PET is a valuable diagnostic tool because the images that it produces show
differences in chemical activity in different parts of the body. These differences
may result from normal processes or they may be the result of diseases such as
cancer or other functional abnormalities in the body. The great diagnostic
benefit of PET results from the fact that it is possible to incorporate positron
emitters into a wide variety of radiopharmaceuticals which can be produced to
target specific organs or chemical processes in the body.
PET has provided new knowledge, in particular about the functions of the
normal brain, as well as of diseases that affect the brain. This knowledge has also
greatly modified brain surgery.
Figure 19.5.3
PRACTICAL
EXPERIENCES
Activity 19.2
In this false colour PET scan (centre) two regions of high metabolic activity typical
of cancerous tumours are indicated by the bright white hot-spots (in the spine and
the prostate). This image is combined with a CAT image (left) to produce the fused
image (right) that accurately indicates their location.
Checkpoint 19.5
1
2
3
Explain what features in the body PET scans can be used to identify that X-rays and ultrasound can not.
Outline how the radioisotopes for a PET scan can be introduced into the body.
Explain how coincidence pairs in a PET scanner allow an image to be constructed.
349
19
Imaging with
gamma rays
PRACTICAL EXPERIENCES
CHAPTER 19
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
A bone scan is performed to obtain a functional image of the bones and so can be
used to detect abnormal metabolism in the bones, which may be an indication of
cancer or other abnormality. Because cancer mostly involves a higher than normal
rate of cell division (thus producing a tumour), chemicals
involved in metabolic processes in bone tend to accumulate in
higher concentrations in cancerous tissue. This produces areas
of concentration of gamma emission, indicating a tumour.
Compare the data obtained from the image of a bone scan
with that provided by an X-ray image.
Discussion questions
1 Identify the best part of the body for each of these
diagnostic tools to image.
2 Compare and contrast the two images in terms of
the information they provide.
Figure 19.6.1
Figure 19.6.2
350
Chapter summary
medical
physics
Review questions
Physically Speaking
Reviewing
Radioactive
decay
Radioisotope
Nucleon
Proton
Isotope
Alpha decay
Gamma
decay
Antimatter
PET
Half-life
Bone scan
Positron
decay
Scintillator
Radiation emitted
Half-life
C-11
Tc-99m
TI-201
I-131
Cs-137
U-238
b+, g
g
g
b, g
a
a
20.30 minutes
6.02 hours
3.05 days
8.04 days
30.17 years
4.47 109 years
351
19
Imaging with
gamma rays
positrons.
Solving problems
17
Figure 19.6.3
146
234Th
144
234U
230Th
140
226Ra
138
222Rn
136
218Po
134
214Pb
130
210TI
128
218At
214Bi
214Po
210Pb
126
210Bi
210Po
206Pb
124
122
78
238U
234Pa
142
132
80
82
Figure 19.6.5
84
86
88
90
94 Z
92
50.0%
25.0%
12.5%
Re
iew
352
Q uesti o
6.3%
0
Figure 19.6.4
Figure 19.6.6
8
Years
10
12
3.1%
14
16
medical
physics
PHYSICS FOCUS
ANSTO Radiopharmaceutical
production
Lucas Heights production facility
The Lucas Heights Facility in Sydney, operated by
ANSTO, manufactures radiopharmaceuticals such as
molybdenum-99 generators for technetium-99m and
iodine-131, together with other associated products
including chromium-51, thallium and gallium. The
facilities are ideal for simultaneous production of large
quantities of different isotopes used for diagnosis and
therapy. An average of 3500 medical isotopes and
2500 packages are dispatched per month. The
packaging, transport and delivery of all products
comply with strict national and international
regulations.
Molybdenumplant
ANSTOHealth is in the process of commissioning
a new plant that will be used to manufacture
molybdenum-99. ANSTO is currently importing this
product from overseas to meet domestic needs;
however, this is costly (as a heavy airfreight product)
and shipment is not always reliable, which emphasises
the importance of local Australian production.
Molybdenum-99 is used as a raw material for 80%
of nuclear medicine procedures performed around
the world.
Molybdenum-99 is formed by the fission of
uranium-235 that is itself formed by irradiation of low
enriched uranium (LEU) in ANSTOs nuclear reactor.
The molybdenum-99 is then used as a generator for
technetium-99m, which can then be formulated into
a plethora of radiopharaceuticals.
The new plant is located behind shielding because
of radiation, and has been carefully designed to
protect workers and the external environment during
and after processing.
Extension
4 The OPAL nuclear reactor opened in 2007,
replacing the earlier HIFAR reactor. Some critics
of the nuclear reactor program claim a new reactor
was unnecessary.
353
20
spin, magnetic moment, parallel,
antiparallel, precession, Larmor
frequency, radio frequency (RF),
magnetic resonance imaging (MRI),
resonance, relaxation, longitudinal
relaxation time constant, transverse
relaxation time constant,
RF transceiver coils, gradient coils,
functional MRI,
Imaging with
radio waves
Hydrogen calling
Magnetic resonance imaging (MRI) uses radio waves emitted by
hydrogen atoms to produce high-resolution images of the body.
Magnetic resonance has been used as a tool for studying atomic
structure since 1946, but it was not until 1973 that medical imaging
was suggested, and another 15 years before useful images were
obtained. Like CAT scans, MRI can produce tomographic images
(slices) of the body, enabling detailed two- and three-dimensional
images to be constructed from the data. Magnetic resonance imaging
was initially used to produce images showing structural detail;
however, recent developments have led to functional MRI,
which has greatly advanced knowledge of how the
brain and other organs work.
spin vector
protons
spin vector
354
Figure 20.1.1
medical
physics
Spin produces a magnetic effect called a magnetic moment or
Identify that the nuclei of
magnetic dipole. You can think of this as being similar to the magnetic field
certain atoms and molecules
produced by moving charge (a current) around the wire of a solenoid (see Figure
behave as small magnets.
4.1.10). Therefore, charged particles such as electrons and protons behave like
tiny magnets. Even uncharged neutrons have a magnetic moment, because they
contain fundamental charged particles called quarks (see section 15.5). Nuclei
with an odd number of nucleons will therefore have a net spin and always have
an unpaired magnetic moment and so behave like tiny magnets. When placed in
a strong magnetic field these nuclei behave a little like a compass needle in the
A compass needle always points north, but the
Earths magnetic field.
Explain that the behaviour of
nuclei magnetic moments may assume one of two possible alignmentsthey can
nuclei with a net spin, particularly
align parallel or antiparallel to the magnetic field. External magnetic fields or
hydrogen, is related to the
magnetic field they produce.
the magnetic field of electromagnetic radiation may affect the nuclei because
of this magnetic moment.
north
Hydrogen is the most important element in the
process of magnetic resonance imaging because of
magnetic
the magnetic properties of the nucleus and the fact
moment
that it is present in water molecules, proteins, fats
and carbohydratesin fact, in most of the
molecules in the body. Since it normally has just
a single proton forming the nucleus, a hydrogen
nucleus will clearly have a magnetic moment. The
magnetic properties of hydrogen are not normally
evident, because the magnetic moments of the
south
nuclei are randomly aligned, resulting in a zero
net magnetic effect (Figure 20.1.2).
Figure 20.1.2 The net spin of hydrogen results in a magnetic moment that is often
not apparent in large groups of hydrogen nuclei.
Checkpoint 20.1
1
2
3
4
355
20
Imaging with
radio waves
no external field
therefore slightly preferredbut only about 0.0003% more protons favour the
parallel configuration. Injecting the energy difference that corresponds to a photon
with a radio frequency of 42.6 MHz can cause the proton to flip from the parallel
to the antiparallel configuration. This same energy will be released if it flips back.
When the applied external field is removed, the protons become randomly
orientated again because of collisions occurring between the randomly moving
atoms.
Precession
external magnetic field
parallel
antiparallel
Figure 20.2.1
Randomly orientated
hydrogen nuclei align to
an externally applied
magnetic field.
h
where h is Plancks constant.
For a proton in a 1 T magnetic field,
the Larmor frequency is 42.6 MHz,
corresponding to radio frequency (RF)
electromagnetic waves.
Try this!
Figure 20.2.2
Precession of a hydrogen
nuclei around the applied
magnetic field
precession
A spinning top
When you try to push over a childs spinning top, it doesnt fall
over. Try it! Instead, the tops axis of rotation itself starts to
rotate around the vertical axisit precesses (at least until
frictional forces cause it to tip too far and it hits the ground).
The force of gravity produces a twisting effect (a torque) on
the top. If the top werent spinning, this would cause it to
simply fall over. Precession is the motion that results when the
top is spinning. The axis of rotation of the top sweeps out
a conical motion.
356
rotation
force of gravity
Figure 20.2.3
medical
physics
Checkpoint 20.2
1
2
3
PRACTICAL
EXPERIENCES
Activity 20.1
357
20
Imaging with
radio waves
Resonance
nut
1st fret
5th fret
Figure 20.3.3
Longitudinal magnetisation
Relaxation
100%
Time
T1
Figure 20.3.4
358
T1 (ms)
520
390
2000
180
600
800
2500
T2 (ms)
100
90
300
90
40
180
2500
medical
physics
100%
Transverse magnetisation
Figure 20.3.5
Checkpoint 20.3
1
2
3
4
Bxy
T2
Bxy
Time
Bxy
Figure 20.4.1
359
20
Imaging with
radio waves
70.6
84.3
80
93
12.2
Checkpoint 20.4
1
2
3
4
Figure 20.4.2
PRACTICAL
EXPERIENCES
Activity 20.2
360
medical
physics
RF
transceiver
Y coil
Z coil
patient
patient
table
gradient
coils
X coil
superconducting
magnet coils
scanner
RF transceiver
X
Figure 20.5.1
Figure 20.5.2
Checkpoint 20.5
Explain why the cost of running MRI scanners is so high.
patient
Figure 20.5.3
PRACTICAL
EXPERIENCES
Activity 20.3
361
20
Imaging with
radio waves
Figure 20.6.1
One of the first highly successful applications of MRI was its use in diagnosing
multiple sclerosis, an incurable disease that affects the nervous system. MRI clearly
shows the scar-like plaque resulting from this disease, marking the destruction of
the insulating myelin sheaths around the axons of nerve cells (Figure 20.6.1),
something other imaging technologies are unable to do. Other examples of MRI
imaging are shown in Figures 20.6.2 and 20.6.3.
MRI has changed scientific thinking about the progress of diseases such as
cancer and multiple sclerosis, because these diseases can now be detected earlier
and different treatments can then be used. MRI also allows the progress of a disease
and its treatment to be monitored more safely, increasing the chances of survival.
False-colour MRI of the brain
of a person with multiple
sclerosis shows regions
of myelin destruction
(blackorange).
Advantages of MRI
MRI is non-invasive and has no known side effects, although a few patients
experience anxiety in the confined, noisy space of the MRI scanner.
MRI provides excellent soft-tissue imaging, providing better contrast than CAT
or conventional X-rays and much better resolution than ultrasound.
Image contrast can be altered by changing the T1 and T2 weightings.
There are few cases of adverse reaction to MRI contrast agents.
Disadvantages of MRI
Figure 20.6.2
Patients with heart pacemakers or metal parts in their bodies cannot be imaged
because of the strong magnetic fields.
Scans may take up to an hour to complete.
MRI scanners are very expensive and the running costs are high.
MRI scanners are very sensitive and must be screened from outside radio and
magnetic interference.
The strong magnetic field produced can interfere with nearby electronic
equipment and is dangerous in the presence of iron tools or furniture.
Most current MRI images show structure within the body. An important
area of technical development is the production of functional MRI images that
reveal differences in chemical processes happening in the body. Other
developments include the use of even stronger magnetic fields, allowing better
resolution and contrast, and ultimately imaging using elements other
than hydrogen.
The possibility of future high temperature superconductors, requiring only
liquid nitrogen cooling, could impact significantly on MRI technology by
removing the need for expensive, non-renewable liquid helium. Research and
development has already produced portable MRI scanners for specialised
applications. These machines do not require the use of superconducting
magnets, greatly reducing the size and cost of the machine.
Checkpoint 20.6
1
Figure 20.6.3
362
PRACTICAL EXPERIENCES
medical
physics
CHAPTER 20
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Perform an investigation to
observe images from magnetic
resonance image (MRI) scans,
including a comparison of
healthy and damaged tissue.
Identify data sources, gather,
process and present information
using available evidence to
explain why MRI scans can be
used to:
detect cancerous tissues
identify areas of high blood
flow
distinguish between grey and
white matter in the brain.
Gather and process secondary
information to identify the
function of the electromagnet,
radio frequency oscillator, radio
receiver and computer in the
MRI equipment.
363
20
Imaging with
radio waves
Chapter summary
f Larmor =
h
Review questions
Physically Speaking
Complete the passage below by filling in the missing words
from the list provided.
antiparallel, field direction, energy, flip, higher energy,
intensity, relaxation time constants, magnetic resonance
imaging, quantum mechanical, parallel, protons, model,
radio, randomly, reality, rotation, magnetic field, spin
364
medical
physics
Reviewing
1 Explain why fluorine-19 has a nuclear magnetic
moment, but nitrogen14 does not.
MRI machine.
Solving problems
18 When a 1 T field is applied to a portion of the body
containing 2 million protons, estimate how many
extra protons favour the parallel configuration over
the antiparallel configuration.
iew
Q uesti o
MRI images show a normal brain (left) and the brain of a patient with a tumor.
Re
Figure 20.7.1
365
Multiple choice
(1 mark each)
1 Two differences between the normal hearing range
of sound and ultrasound are:
A
B
C
D
Substance
Normal sound
Ultrasound
Easily scattered by
human tissue, above
20kHz
Longer wavelength,
inaudible
Above 20kHz, shorter
wavelength
20 Hz to 20kHz,
inaudible
Fat
Kidney
A 0.29%
B 1%
Ultrasound
velocity (m s1)
Impedance
(rayl)
920
1450
1.33 106
1040
1561
1.62 106
C 12%
D 18%
Figure 20.8.1
366
Density
(kg m3)
medical
physics
Short response
Extended response
10
Radioactive
source
Radiation
emitted
Half-life
C-11
b+, g
20.30 minutes
Tc-99m
6.02 hours
I-131
b, g
8.04 days
Cs-137
30.17 years
U-238
Figure 20.8.2
367
6
CONTEXT
Astrophysics
I render infinite thanks to God, for being so kind as to make me alone the first
observer of marvels kept hidden in obscurity for all previous centuries.
Galileo Galilei in Siderius Nuncius (Sidereal Messenger)
368
The Italian scientist Galileo Galilei (15641642) was not quite right in his claim to
be the first to see the universe revealed by the newly invented telescope. That
accolade may belong to Englishman Thomas Harriot (15601621). However, it was
certainly Galileos short record of his observations in Siderius Nuncius (Sidereal
Messenger), published in March 1610, that announced a revolution in astronomy to
the educated world in Europe. Observations with telescopes revealed previously
unknown objects, unimagined by thousands of years of naked-eye observers.
Nonetheless, astronomy remained largely a collectors science for another two
centuries. Observers used telescopes of increasing size to discover objects and note
their positions and details, but knew little of their true nature. It was not until the
early years of the 19th century that physicists discovered lines in the spectrum of
sunlight, and later realised that they were the fingerprints of the elements making
up the Sun and the stars. Although Newtons law of universal gravitation explained
motion in the sky, it was the understanding of spectral lines that really put physics
into astronomy. Spectroscopy remains at the heart of modern astrophysics.
In this module we will explore some aspects of both gravity and spectroscopy in
astrophysics, leading to our current understanding of the life history of stars.
INQUIRY ACTIVITY
Learn about a star
A lot of astronomical data gathered by astronomers is now available on-line in
various catalogues. One important catalogue of stars was produced by the
Hipparcos satellite in the early 1990s. It observed the positions and basic
observable properties of more than 100000 stars with high precision and more
than 2.5 million stars to lower precision.
In this exercise, you will use the Hipparcos catalogue to find out about one
of the bright stars of the Southern Cross (the constellation Crux) or a neighbour
in the constellation of Centaurus.
1 Find a map of Crux and Centaurus,
perhaps in a star atlas or a computer
program that shows the stars (e.g. Google
Sky or WikiSky). Youll need to orient the
map to match Figure 21.0.2 above.
2 Identify the brightest stars of Crux and
Centaurus seen in Figure 21.0.2.
3 Select one of these stars and look it up
in the Hipparcos online catalogue, which
can be accessed on the companion
website at Pearson Places
<www.pearsonplaces.com.au>.
To do this, enter the name
(e.g. Alpha Crucis) into the Target Name
field and then click on Submit Query.
The database should respond with a single
line of data about your star.
369
21
Eyes on
the sky
Who needs a telescope?
Looking at the sky with the naked eye involves an optical system with
a maximum diameter of about 7mmthe size of the pupil of your
dark-adapted eye. The brightness of the image and the ability to
resolve fine detail are both set by that size. Using a telescope
increases the diameter of the light-collecting aperture to perhaps
60mm in a small backyard telescope, or 10m in the largest optical
telescopes currently in use. The aperture is the most important
property of any telescope operating at any wavelength because it
governs the ability to see faint objects and resolve fine detail.
astrophysics
What is
magnification?
he magnification of an optical
telescope describes how much
bigger an object appears in the
telescope compared to the nakedeye view. Technically, it is the
ratio of the angular size of the
object with and without the
telescope. A pair of 7 50
binoculars, for example, has an
angular magnification of 7 times
(7) and 50mm diameter main
lenses. The view through the
binoculars is as if the object was
only 1/7th as far away.
Figure 21.1.2 One of Galileos drawings of the Moon compared with a photograph
He also saw four stars moving back and forth relative to Jupiter in a pattern
he recognised as orbital motion around Jupiter. The observation of these
Galilean satellites almost led to the discovery of the planet Neptune, more than
two centuries before its actually discovery.
Galileos instruments were refracting telescopes that used a planoconvex lens (Figure 21.1.3) as the objective lens. The properties of the glass
and the shape of this lens bend the light and bring it to a focus (Figure 21.1.4).
The smaller eyepiece lens then relays the light to your eye, which creates a new
image on the retina at the back of the eyeand you see the object!
Galileos telescopes used a plano-concave lens (Figure 21.1.3) as the eyepiece.
This results in an image that is upright; but the telescope then has a very narrow
field of viewthe area of sky you can see at any momenta little like looking
down a narrow pipe at the object.
In 1611, Johannes Kepler (15711630) pointed out that a plano-convex
eyepiece lens would also work (Figure 21.1.4), although the image was inverted.
Christoph Scheiner (15731650) popularised this form of Keplerian refractor
by pointing out that the field of view was larger than in the Galilean telescope
(as illustrated in Figure 21.1.5).
With the field-of-view problem reduced in the Keplerian (or astronomical)
refractor, a race began for higher magnification. This was most easily achieved by
increasing the focal length of the objective lensthe distance between the lens
and the image it forms (see section 21.3 for more on calculating magnification).
By the 1670s, long telescopes achieved magnifications of more than 100, but at
the cost of enormous size and resulting difficulty in mounting and operation.
plano-convex
plano-concave
21
Keplerian telescope
objective lens
focus
to eye
starlight
eyepiece
focal length
b
Galilean telescope
objective lens
to eye
starlight
eyepiece
focal length
Figure 21.1.4 (a) Keplerian and (b) Galilean telescopes using an objective lens
Newtonian telescope
to eye
flat secondary
mirror
Cassegrain telescope
hyperbolic
secondary mirror
eyepiece
prime
focus
parabolic
primary
mirror
starlight
eyepiece
prime
focus
to
eye
starlight
focal length
focal length
Figure 21.1.7 Newtonian and Cassegrain telescopes using a primary mirror of the same focal length for easy comparison.
In large telescopes, if no secondary mirror is in place, the prime focus position can be used.
372
parabolic
primary
mirror
astrophysics
apertures were available, and in 1789 William Herschel (17381822) built a
reflector with a 124cm aperture! This was the first giant reflectorthe
forerunner of the 8 and 10m aperture reflecting telescopes of today.
Checkpoint 21.1
1
2
3
Identify what sets the light-gathering ability and resolving power of an optical system.
Outline key astronomical observations made by Galileo.
Describe chromatic aberration and its cause.
21.2 Looking up
For more than 200 years from the time of Galileo, all telescopes were optical
telescopes designed to focus visible light onto the retina of the human eye.
From about 1840, the use of photography revolutionised astronomy by allowing
images of the sky to be permanently recorded. For the next century telescopes
grew in size but remained fundamentally unchanged.
Discuss why some wavebands
In 1933, Karl Jansky (19051950), a physicist working for Bell Telephone
can be more easily detected
Laboratories in New Jersey, realised that part of the background hiss heard in
from space.
Discuss the problems associated
radio communication was coming from the sky. Janksys discovery was followed
with ground-based astronomy
up in 1937 by Grote Reber (19112002), who built a reflecting telescope in his
in terms of resolution and
backyard in Chicago (Figure 21.2.1). This was a radio telescope, with a mirror
absorption of radiation and
made from sheet metal and a diameter of 9.5malmost four times the size of
atmospheric distortion.
the largest optical reflecting telescope of the era. The basic
optical principles of Rebers radio telescope were exactly the
same as those of an optical reflecting telescope, with the
exception that the radio light was detected by a radio receiver
rather than by eye or photography.
Radio astronomy really began to develop after World
WarII, spurred by technical developments during the war
This was possible because a wide range of radio
years.
wavelengths, as well as visible light, penetrate the Earths
atmosphere without significant absorption and can therefore
be observed by telescopes on the ground. Electromagnetic
(EM) radiation of most other wavelengths is absorbed by the
atmosphere before it reaches the ground (Figure 21.2.2).
At wavelengths longer than about 10m, the Earths
ionosphere (see in2 Physics @ Preliminary section 8.4) blocks
radio energy from space. The radio window opens at metre
wavelengths and remains open down to millimetre wavelengths
for telescopes on the ground. However, observing at these
shortest radio wavelengths requires a high, dry location such as
the site of the ALMA mmsub-mm radio telescope, under
construction in the Atacama Desert in Chile.
Figure 21.2.1 Grote Rebers radio telescope
373
Antarctic
astronomy
Absorption by atmosphere
21
Galex
Fermi
Spitzer
Hubble
100%
50%
0%
Parkes
Gemini
0.1
nm
1
nm
10
nm
100
nm
1
m
10
100
m
1
mm
1
cm
10
cm
1m
10 m 100 m 1 km
Wavelength
Figure 21.2.2 The Earths atmosphere stops light from many parts of the electromagnetic spectrum
reaching the ground.
Checkpoint 21.2
1
2
Magnification
Perhaps surprisingly, the concept of magnification (sometimes called
power) is not too important in most astronomical telescopes. It is easily
calculated if the focal length of the objective lens or mirror, fo, and the focal
length of the eyepiece, fe, are known. The magnificationm is then given by:
m=
374
fo
fe
astrophysics
Worked example
Question
A small refracting telescope has an objective lens diameter D of 70mm and focal length fo
of 700mm.
If this telescope has a typical low power eyepiece focal length fe of 25mm, what would
be its magnification?
Solution
m =
fo
fe
700
= 28 (i.e. 28)
25
Sensitivity
The sensitivity of a telescope system describes its ability to see faint objects.
Sensitivity is sometimes called light-gathering power. It depends on how much
light the telescope collects and how much of that light is delivered to the detector.
The primary factor controlling light-gathering power is the diameter D of the
telescopes objective lens or mirror. Most telescopes have circular mirrors and so
the collecting area is proportional to D2. A 10 m diameter telescope therefore
collects 100 times the light of a 1m diameter telescope. This is the primary
reason for building bigger telescopes, although it is often not quite that simple,
as illustrated by the example of the Anglo-Australian Telescope (AAT).
Worked example
Question
What is the collecting area of the 3.9m AAT, and how does it compare with that of a
70mm diameter refracting telescope?
Solution
The refracting telescope has D = 70mm, so its collecting area can be calculated using:
Collecting area = p
2
D
The AAT has D = 3.9m; however, it also has an obstruction caused by the secondary
mirror and its housing, as seen in Figure 21.3.1. In the f/8 Cassegrain configuration, this
obstruction is about 20% of the collecting area. So the ratio of the collecting areas of the
two telescopes is:
3900 2
p
0.8
2
Collecting area AAT
=
2
Collecting area 70mm
70
p
2
2500
The AAT has 2500 times the light-gathering power of the 70mm telescope.
Once the light has entered the telescope, it must be delivered to the detector
and this may involve a series of lenses and mirrors. Each of these may reflect,
absorb or scatter a little of the light and reduce the amount that reaches the
detector. In the AAT there are just two mirrors between the sky and a detector
placed at the Cassegrain focus, yet 10% of the light is lost.
21
Apart from the telescope itself, the overall sensitivity also depends on several
other factors:
Atmospheric transmissionFor telescopes on the Earths surface, the
proportion of the light that penetrates the atmosphere depends on
wavelength, as illustrated in Figure 21.2.2.
Detector efficiencyOften much less than 100% of the light striking the
detector is actually recorded.
Observing timeUnlike the human eye, most detectors record more light
if they are left exposed for longer times.
BackgroundThe sky is not truly dark because of natural sources and
man-made pollution at optical and radio wavelengths.
Angular resolution
R2D
D 206265
(Recall the arc second as a unit of angle from in2 Physics @ Preliminary section
13.4.)
The angular size of this pattern depends only on the wavelength of the light
and the diameter of the telescope. For most telescopes, R is much bigger than the
apparent size of even the largest star as seen from Earth.
If two stars are close together, their Airy discs may overlap so that it is
impossible to see that there are two stars. They are unresolved. The stars can
theoretically be resolved and seen as two stars (Figure 21.3.3) if they are
separated by at least
R (in arc seconds) = 1.22
D 206265
Resolution also affects images of larger objects such as the Moon or planets,
since each point of the object acts like a distant star and is blurred into an Airy
disc (Figure 21.3.3).
Figure 21.3.3 A magnified view of an Airy
Figure 21.3.4 Simulated views of Saturn, showing the effect of resolution using (a) a 10cm diameter
telescope and (b) a 30cm diameter telescope
376
astrophysics
Notice that the angular resolution of a telescope is expected to be better
(R is smaller and the image is sharper) for a big telescope. It is also better if the
wavelength is small (e.g. much better for visible light than for radio waves). This
is illustrated by the theoretical resolution values for 10m diameter telescopes
listed in Table 21.3.1. For comparison, the size of the Sun or Moon in the sky is
around 1800arc seconds, while a $1 Australian coin seen from a distance of
100km would appear just 0.05arc seconds across.
Table 21.3.1 Theoretical resolution of 10m diameter telescopes at different wavelengths
Resolution R for D = 10m
Band
Typical wavelength
Ultraviolet (UV)
100nm
0.0025
Optical
500nm
0.013
2000nm (2 m)
0.050
Radio (millimetre)
1mm
25
Radio (centimetre)
21cm
5300
(arc seconds)
PRACTICAL
EXPERIENCES
Activity 21.1
TRY THIS!
Checkpoint 21.3
2
3
4
Active optics
These tight tolerances may be impossible to achieve if the mirrors are so large
that they bend under their own weight. Alternatively, the telescope itself may
bend as it points in different directions. In the past, both effects could largely be
overcome by building massive telescopes. The dish of the Parkes radio telescope
(Figure 21.4.1) is 64m across and weighs 300tonnes, but also has a clever design
to allow for flexure of the structure.
21
Seeing
Discuss the problems associated
with ground-based astronomy
in terms of resolution and
absorption of radiation and
atmospheric distortion.
PRACTICAL
EXPERIENCES
Activity 21.2
378
For all wavelength ranges, coping with the slowly changing distortions in the
telescope itself is an issue. In contrast, the much faster effects of seeing are
problems restricted to ground-based optical and infra-red telescopes.
Seeing refers to the effects on images of tiny, rapidly changing
temperature variations in the Earths atmosphere. These effects distort the path
of the visible and, to a smaller extent, the infra-red light passing through the
atmosphere so that even a perfect optical/IR telescope will not produce a sharp
image. Images in a large telescope are distorted into a constantly moving and
changing speckle pattern that averages over time into a blurred seeing disc
(Figure 21.4.3). The same effects produce the familiar scintillation, or twinkling,
of stars. Scintillation describes rapid changes in the brightness of a star, while
seeing refers to blurring of a star image.
The turbulence that produces seeing is usually concentrated near the ground
and in discrete layers in the atmosphere. It depends on local weather conditions
and geography. The best locations for good seeing are typically tall mountains
that project above the relatively smooth stream of air flowing across open oceans.
Most of the worlds largest optical or IR telescopes are therefore sited on islands,
astrophysics
Figure 21.4.3 Simulated images of a binary star seen through a large telescope in good seeing(a) without seeing effects, (b) the speckle pattern
seen in a 10ms exposure time and (c) the seeing disc seen in longer exposure times. The arrow indicates 1arc second.
science target
laser guide star created
in sodium layer at 90 km
Adaptive optics
Obviously, the ultimate way to beat the seeing effects is to
place a telescope in space, above the atmospheric
turbulence. The 2.4m diameter Hubble Space Telescope
(HST) has spectacularly demonstrated the power of
observing from space since its launch in 1990. However, it
is incredibly expensive and the largest optical/IR telescopes
are on mountain tops, not in space.
For at least 40 years, astronomers and engineers have
been working towards beating the seeing effects for
ground-based telescopes by using adaptive optics.
Adaptive optics is essentially active optics, working much
fastercorrecting hundreds of times a second to overcome
the rapidly changing seeing effects.
An adaptive optics system (Figure 21.4.4) works
by using a wavefront sensor to measure the image
distortion of a target star. A tilting mirror is used to stop
the image moving and a deformable mirror is used to
sharpen the image. The result is a much sharper image,
although still not perfect (Figure 21.4.6).
The target star may not be the real target of scientific
interest, but it must be very close and bright enough for
the adaptive optics system to work. A single system can
measure and correct only a tiny patch of sky, since the
blurring is different just a few arc seconds away. If there is
not a bright enough adaptive optics target star close to the
scientific target, then astronomers can provide one! A laser
guide star is created by firing a high-power laser up from
the telescope, causing the atmosphere to glow at about
90km altitude. To the adaptive optics system it looks like
a star that can be moved to where it is needed.
layers of turbulence
in the atmosphere
laser projection system
telescope
sodium laser
distorted wavefronts
producing a moving
blurred image
tilting
mirror
smoothed wavefronts
producing a sharp image
real-time
control computer
wavefront
sensor
moving, blurred
image of
guide star
observing
instrument
21
All large optical/IR telescopes now have some sort of adaptive optics system.
The next generation of even larger telescopes will rely on multiple adaptive
optics systems and laser guide stars.
Seeing is not
believing!
Figure 21.4.6 The blurred image (left) of the star IW Tauri reveals a pair of stars separated by
0.3arc second (right) when imaged using adaptive optics on the 5m Hale Telescope.
Checkpoint 21.4
1
2
3
4
5
21.5 Interferometry
The goal of adaptive optics is to achieve the resolution limit set by the diameter
of the telescope and the wavelength of the light. Given the long wavelengths of
radio waves, it is impractical to build a single large radio telescope to achieve
The
the resolution possible with even a very small optical telescope.
solution is to use two or more radio telescopes and link them together to form an
interferometer. Provided the effective path to the detector via each telescope is
the same to within a few wavelengths, interference will be seen. The interference
pattern contains information about the image of the astronomical object.
The ability of an interferometer to resolve fine details is governed by the
equation:
R (in arc seconds) = 1.22
D 206265
where D is now the distance between any two telescopes, not their individual
diameters. The resolution of the interferometer is about the same as having a
large telescope of diameter Dbut only in the direction along the line between
the telescopes. At right angles to that direction the resolution is the same as using
only one telescope.
If the interferometer is formed from an array of several telescopes, then each
pair provides good resolution in one direction. With enough telescopes, the array
380
astrophysics
can produce good resolution in many different directionsenough to build up
an image of the target. Also, as the Earth rotates, these directions move relative
to the target and the array can synthesise an even better image.
An interferometer produces the resolution of a single large telescope, but
with much less sensitivity, because only a small proportion of the light is
collected by the array of telescopes (Figure 21.5.1).
Interferometry is fundamental to the operation of almost all large radio
telescopes. The main interferometric array of radio telescopes in Australia is the
Australia Telescope Compact Array (ATCA), located near Narrabri, NSW
(Figure 21.5.2). It has six 22m dishes spread over 6km in a pattern that can be
varied. However, it is sometimes used with other dishes spread around Australia
or the world to form a much larger array.
Interferometry is also now being
applied in optical/IR astronomy. The
Sydney University Stellar
single telescope
mirror of the
Interferometer (Figure 21.5.3), also
same resolution
located at Narrabri, is one of the
pioneering instruments in this field.
The much shorter optical
wavelengths make the tolerances
much tighter on the optical
components and their positions, and
seeing effects are once again a major
problem.
individual telescopes
Figure 21.5.2 Some of the dishes of the Australia Telescope in a compact configuration
2
3
Checkpoint 21.5
1
What is
interferometry?
21
Figure 21.6.1 (a) A composite image of the nearby active galaxy Centaurus-A, with (b) an X-ray image and
(c) a radio image revealing much more than is seen in (d) the optical image they overlay.
Checkpoint 21.6
Contrast the maximum theoretical resolution of a 42m diameter ELT, the JWST and the SKA. (You will need to assume
some appropriate wavelength values.)
382
PRACTICAL EXPERIENCES
astrophysics
Chapter 21
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For more detailed instructions and advice,
use the in2 Physics @ HSC Activity Manual.
383
21
Chapter summary
Review questions
Physically speaking
Copy and complete the following table with the knowledge that you have gained after studying this chapter.
Concept
Telescope
Angular resolution
Sensitivity
Active optics
Adaptive optics
Interferometry
384
Definition of concept
astrophysics
Reviewing
1 Recall how the introduction of technology has influenced the discoveries
made in astronomy, with particular reference to Galileo and the telescope.
4 Define the following terms and describe how they relate to a telescope:
collecting area
sensitivity
magnification
Solving problems
13 Compare the sensitivity of a telescope with a diameter of 10cm with one
of 20cm.
15 What is the size (in light-years) of the smallest features in M31 that can
Re
iew
Q uesti o
385
21
PHYSICS FOCUS
Beautiful death-star could
threaten Earth
The Earth may be in the firing line when one of the
skys most beautiful objects explodes, according to
University of Sydney astronomer Peter Tuthill.
Dr Tuthill discovered the elegant rotating pinwheel
system, named WR104, in the constellation
Sagittarius. It includes a highly unstable star known
as a Wolf-Rayet, widely regarded by astronomers as a
ticking bombthe last stop in a stars life before
a cataclysmic supernova explosion.
Figure 21.7.2 Spiral pattern around the massive star WR104, as seen using masked aperture interferometry
386
astrophysics
Extension
4 Peter Tuthill used a technique called masked
aperture interferometry to obtain this image. It
uses a single telescope instead of two or more.
How does interferometry work on a single
telescope?
5 How does the resolution of this technique compare
with that possible using an interferometer such as
the Sydney University Stellar Interferometer or the
Australia Telescope Compact Array?
6 How much of a risk to Earth do you think is posed
by the star WR104?
387
22
astrometry, parallax, astronomical
unit, light-year, parsec, spectroscopy,
spectrum, Fraunhofer lines, continuous
spectrum, emission line spectrum,
absorption line spectrum, black body,
black body curve, Planck curve,
spectroscope, prism, spectrograph,
diffraction grating, spectral classes,
brightness, luminosity, Doppler effect,
photometry, magnitudes,
apparentmagnitude, absolute
magnitude, distance modulus,
colourindex, spectroscopic parallax
Sun
Earth
6 months
from now
Figure 22.1.1
388
p
d
relatively
nearby star
photo taken
6 months
from now
Measuring
the stars
Fingerprinting the stars
The invention of the telescope led to a revolution in astronomy.
The Moon and planets were found to be unique worlds, faint
stars were revealed and new mysteries such as spiral nebulae
were discovered. However, astronomers still couldnt answer the
most basic questions: How far away are the stars? What are they?
The answer to the first question came in 1838 with the first
successful measurement of the parallax of a star. The clue to the
second question was already in place by then with Fraunhofers
observation of spectral lines in light from the Sun. The
understanding of these lines as fingerprints of the stars
is the basis of modern astrophysics.
astrophysics
The distance d of the star and the parallax angle p are simply related by:
r
tan p =
d
Using the small angle approximation (sinp tanp p) since p is so small, and
measuring p in arc seconds, r in astronomical units (AU) and d in parsecs (pc)
results in an even simpler form for this equation:
p=
1
d
Worked example
Question
Proxima Centauri is the nearest star to Earth. It has a parallax p = 0.76887 arc seconds.
What is its distance in parsecs, light-years and astronomical units?
Solution
1
1
d= =
= 1.3006 pc = 4.2421 ly = 26 827 AU = 4.0133 1016 m
p 0.76887
Checkpoint 22.1
1
2
Units of
distance
PRACTICAL
EXPERIENCES
Activity 22.1
PRACTICAL
EXPERIENCES
Activity 22.2
22
Measuring
the stars
The distances in astronomy are an enormous barrier to learning about the stars and
planets, but Compte was wrong in thinking we would never know about their
composition. Astronomical spectroscopy is the study of the light from objects in
the universe to reveal their composition and physical characteristics.
The first step had been made long before when Isaac Newton
(16431727) realised that white light is composed of component
colours, revealed when the light is passed through a glass prism. In
1802, William Wollaston (17661828) looked more closely and
noticed dark lines in the rainbow of colours that form the spectrum
of the Sun. In 1814 Joseph von Fraunhofer (17871826) used a
spectroscope to rediscover the lines and eventually catalogued over 570
of these Fraunhofer lines (Figure 22.2.1). By 1860, Gustav Kirchhoff
(18241887) and Robert Bunsen (18111899) and others discovered
that chemical elements produced bright spectral lines when heated and
They
some of these lines matched the dark Fraunhofer lines.
deduced that the Fraunhofer lines were produced by absorption of light
Figure 22.2.1 Fraunhofers spectral lines superimposed
by these elements in the Sun, providing the key to understanding the
on the rainbow spectrum of the Sun
composition of the stars.
Kirchhoff s empirical laws of spectrum analysis describe how to produce
three of the key types of spectra we observe in astronomy:
Account for the production of
A hot, dense gas produces a continuous spectrum.
emission and absorption
A hot, low-density gas produces an emission line spectrum.
spectra and compare these
A continuous spectrum source viewed through a cool, low-density gas produces
with a continuous black body
spectrum.
an absorption line spectrum.
Describe the technology
needed to measure
astronomical spectra.
PRACTICAL
EXPERIENCES
Activity 22.3
400
500
600
Wavelength (nm)
Figure 22.2.2
A continuous spectrum
Continuous spectra
When you heat an object such as a steel bar it begins to glow red and orange and
then perhaps white before melting. A very low resolution spectrograph reveals that
at any of these temperatures the bar is emitting a continuous range of
wavelengthsa continuous spectrum (Figure 22.2.2).
Any relatively dense materiala solid, liquid or high density
gaswill behave in the same way. They approximate a black body (see
section 9.2), which is the idealised example of a hot object. A black
body emits light at all wavelengths, with a distribution described by a
700
black body curve (or a Planck curve) that depends only on
temperature (Figure 22.2.6), not composition. In astronomy, the
interior of a star is very close to a black body source.
The wavelength of peak emission of a black body curve max is given by
Wiens law (see in2 Physics @ Preliminary section 14.2):
2.9 103
max =
T
where T is the temperature on the kelvin scale. The change in max with
temperature alters the balance of the visible colours and leads to a change
in the overall colour of the object. Cool objects (T 4000 K) look red but emit
most of their light in the infra-red. Peak emission from a hot object (T 10000 K)
is in the ultraviolet (UV) and the object looks blue-white to the eye.
The total energy emitted per second per unit area of the surface is
represented by the area under the black body curve. This power increases very
rapidly with temperature according to the StefanBoltzmann law:
P = 5.67 108 T 4
390
astrophysics
PHYSICS FEATURE
light from
telescope
Spectroscopes
and spectrographs
slit
grating
focusing
mirror
camera
Figure 22.2.3
A basic spectrograph
b
Intensity
collimating
mirror
measure intensity
along a slice
through the spectrum
H
H
H
500
600
700
Wavelength (nm)
Figure 22.2.4
391
22
Measuring
the stars
Worked examples
PRACTICAL
EXPERIENCES
Activity 22.4
Question
In Figure 21.0.2, the two bright stars on the left are and Centaurithe pointers to the
Southern Cross. a Centauri is actually a double star, and the brighter star has a surface
temperature of ~5800 K. b Centauri is a triple system, with the brightest star having a
surface temperature of ~23000 K. Which star is which in the image?
Solution
Intensity
12 000 K
7500 K
6000 K
4500 K
3000 K
500
1000
1500
2000
Wavelength (nm)
Figure 22.2.6
Centauri
2.9 10 3
max =
23 000
130 nm
This is in the ultraviolet, so we expect a blue-white overall colour.
Clearly the left star in the image is Centauri and the right star is Centauri.
Question
What is the relative power output of the two stars?
Solution
Using the StefanBoltzmann law for the two stars:
Centauri
P = (5.67 108 Wm2 K 4 )(5800)4
= 6.4 107 Wm2
Centauri
P = (5.67 108 Wm2 K 4 )(23 000)4
= 1.6 1010 Wm2
So, each square metre of the surface of the hotter star Centauri emits nearly 250 times
more energy per second than the surface of Centauri. The hotter star is also about 8 times
larger and so has 64 times the surface area of the cooler star. Thus the luminosity of the
hotter star is about 16000 times the intensity of the cooler one.
587.5
667.8
Wavelength (nm)
Figure 22.2.7
392
An emission spectrum
When you inject energy into a low-density gas, the spectrum is very
different. It shows bright emission lines at wavelengths characteristic of the
elements present in the gas (Figure 22.2.7). Familiar examples include the spectra
from gas discharge lamps and neon signs. Low-density gas clouds in interstellar
space also glow with an emission line spectrum.
astrophysics
The origin of emission lines lies in the structure of the atom (see Chapter
12 From Rutherford to Bohr). When an atom absorbs energy, an electron will
make a transition to a higher energy state, provided the absorbed energy exactly
matches the energy of the transition. It will often then quickly make one or more
downward transitions back to lower energy states, emitting energy.
This process is described as the absorption or emission of photons. In making
a downward transition from energy E2 to energy E1, an atom emits a photon with
energy E related to its wavelength (or frequency f) by the relation:
E = E 2 E1 =
h
= hf
c
continuous spectrum
Where does
black body light
come from?
Wavelength (nm)
Figure 22.2.8
An absorption spectrum
absorption spectrum
diffuse gas
(e.g. outer layers of
a star or a nebula)
emission spectrum
Figure 22.2.9
All three types of spectra are produced by the Sun: continuous spectrum from
deeper layers, absorption lines from the photosphere and emission lines from the
chromosphere and corona.
393
22
Measuring
the stars
Spectrum
Emission
nebulae
Dominated by strong
emission lines
characteristic of the gas
composition
Example
1
0.8
Relative flux
Object
0.6
0.4
0.2
450
500
550
600
650
(nm)
Collections of billions of
stars, gas and dust; light
output is generally
dominated by the mix of
stars of various types
0.3
Relative flux
Normal
galaxies
0.2
0.1
400
500
600
700
800
(nm)
A continuous spectrum
with emission lines (that
may be variable) suggesting
fast-moving gas clouds
0.8
0.6
Relative flux
Quasars
H + [OIII]
0.4
0.2
400
500
600
700
800
(nm)
Checkpoint 22.2
1
2
3
astrophysics
Spectral classes
The Harvard classification scheme developed during the project remains in use
today. It started as groups of spectra with similar spectral lines, given letters
from A to N, with O, P and Q added for some unusual stars. Re-ordering and
simplification of the sequence led to the current, seemingly random sequence
The major spectral classes are:
of letters.
Brightness
and Luminosity
OBAFGKM
(We can remember this odd sequence using various mnemonics, the best
known being Oh Be A Fine Girl [Guy] Kiss Me). With additions of other classes,
the sequence can be represented as:
WOBAFGKMLT
The black body curves underlying the shape of the spectra clearly imply
a sequence in surface temperature from hot O stars to cool M stars. It was not
until the 1920s that the understanding of electron energy levels in atoms
progressed enough to understand how surface temperature also controlled the
appearance of the spectral lines. Composition differences between stars are
relatively small and are not the reason why spectra vary so much.
A sample of spectra from various spectral classes is shown in Figure 22.3.1.
Some of the key features of each spectral class are described in Table 22.3.1,
emphasising that the colour of a star, its spectral class and surface temperature are
all closely related.
Each spectral class has been divided into 10 sub-classes. The Sun, for example,
is a yellow G2 star with a surface temperature of ~5770 K, placing it at the hot end
of spectral class G.
Luminosity classes
When plotting a HertzsprungRussell (HR) diagram (see in2 Physics @ Preliminary
section 15.3), we can use the spectral class on the horizontal axis, since it is closely
related to the surface temperature (Figure 22.3.2).
Effective temperature
(K)
>50 000
Colour
Blue
Strength of
hydrogen lines
Weak
Other spectral
features
He, C, N emission
lines
+
% of main sequence
stars
Extremely rare
3100050000
Blue
Weak
Ionised He lines,
strong uv continuum
0.00003
1000031000
Blue-white
Medium
Neutral He lines
0.1
750010000
White
Strong
0.6
60007500
White-yellow
Medium
53006000
Yellow
Weak
38005300
Orange
Very weak
Ca , Fe, strong
molecules, CH, CN
12
21003800
Red
Very weak
76
12002100
Red
Negligible
Neutral metals,
metal hydrides
Brown dwarf
numbers uncertain
<1200
Red
Negligible
Methane bands
Brown dwarf
numbers uncertain
395
22
Measuring
the stars
1200
1000
Ia
Ib
II
III
IV
V
VI
VII
H
H
800
05
09
B6
A2
A6
F0
F7
G2
Na
400
G7
F0
K5
200
M2
Ca II
M4
TiO
0
350
400
450
Figure 22.3.1
TiO
TiO
500
550
Wavelength (nm)
TiO
600
6 000
4 000
10
10
Supergiants (I)
10
0
Ma
10
4
6
Subgiants (IV)
equ
enc
Sub e (
dw V)
arf
(VI
)
10
1
1
10
8
10
10
White
dwarfs
(VII)
12
10
14
10
O5 BO
0.5
396
AO
FO
GO
Spectral class
KO
0.0
+0.3
+0.6
Colour index
+0.8
MO
+1.4 +2.0
10
650
30 000 10 000
Bright supergiant
Supergiant
Bright giant
Giant
Subgiant
Main sequence dwarf
Subdwarf
White dwarf
600
Relative flux
He
Description
c
The wavelength shifts lead to broadening of spectral
lines that encode information about the temperature
and motion of the gas.
Figure 22.3.2
astrophysics
Worked example
city
velo
t)
rse f sigh
sve
o
tran s line
ros
(ac
veloc
ity th
ro
space ugh
Since the first confirmed detection in 1992, more than 300 planets have been detected
around other stars. Most have been found by looking for a shift in the stars spectral lines
due to the motion of the star caused by the orbiting planets.
Question
1
Jupiters motion causes the Sun to wobble around a small circle at 12 m s . What shift in
wavelength does this represent at an observing wavelength of 500 nm?
Solution
12 ms 1
=
500 109 m 3.00 108 ms 1
line
ght
i
of s
star
city t)
elo
al v f sigh
i
d
o
ra
line
ng
(alo
Earth
Figure 22.3.3
Checkpoint 22.3
1
2
3
4
397
22
Measuring
the stars
PRACTICAL
EXPERIENCES
Activity 22.6
or equivalently:
I
mB mA = 2.50 log10 A
IB
IA
m m
= 100( B A )
IB
Apparent magnitude
The apparent magnitude is a measure of how bright an object is in our
sky. The exact zero-point of the system must be defined carefully, but the Sun,
Moon, some planets and even some stars are clearly brighter than zero
magnitude. Their values are therefore negative. Many stars are invisible to the
unaided eye and therefore have magnitudes much greater than 6 (Figure 22.4.1).
Worked examples
Question
Show that a brightness ratio of 100 corresponds to a difference of five magnitudes.
Solution
30
25
Sun
20
15
full moon
10
5
0
5
Venus at
brightest
Sirius
Polaris
naked eye limit
10
Figure 22.4.1
398
Question
The two stars of the Centauri binary system have apparent magnitudes of 0.01 and 1.34.
a Centauri is believed to be a triple system, with the star Proxima Centauri much fainter
at apparent magnitude 11.09 and some distance away on the sky. What is the relative
brightness of these three stars?
Solution
Choose mA = 0.01, mB = 1.34 and mC = 11.09
IA
= 1001.33 5 3.4
IB
20
30
15
25
IA
=100
IB
faintest
objects seen
by HST
IA
= 10011.08 5 27 000
IC
1
1
:
.
3.4 27 000
astrophysics
Absolute magnitude
Apparent magnitude doesnt tell the whole story, since it does not indicate the
The magnitude system handles this by calculating
true luminosity of a star.
how bright the stars would appear to be in our sky if all were moved to a standard
distance of 10 pc (32.6 ly). To calculate this absolute magnitude we use the
inverse square law (see in2 Physics @ Preliminary section 15.1), which can be
2
expressed as:
I A 10
=
I 10 d A
I
where A is the ratio of the brightness (intensity) at its actual distance dA
I 10
(in parsecs) compared to its brightness at 10 parsecs. Using this expression in the
equation defining magnitudes leads to:
I
m10 mA = 2.50 log10 A
I 10
10
= 5 log10
dA
d
M = m 5 log10
10
Worked examples
Question
Assuming the apparent magnitude of the Sun is 26.72, what is its absolute magnitude?
Solution
1
The distance of the Sun is 1 AU = 206 265 pc
d
M = m 5 log10
10
1
= 31.57
26.72 M = 5 log10
2 062 650
Therefore M = +4.85.
If the Sun were placed at a distance of 10 pc, it would appear as one of many relatively
faint stars visible with the unaided eye.
399
22
Measuring
the stars
Question
The brightest star in the sky is Sirius ( Canis Majoris) with m = 1.43 and M = +1.47.
How far away is Sirius?
Solution
IA
m m 5
= 100( B A )
IB
d
1.43 1.47 = 5 log10
10
2.90
d
= log10
10
5
d
102.90 5 =
10
Thus d = 10 0.263 = 2.63 pc.
This makes Sirius one of the closest stars to Earth. Although it is more luminous than
the Sun, it is the brightest star in our sky mainly because it is so close.
Checkpoint 22.4
1
2
3
PRACTICAL
EXPERIENCES
Activity 22.5
Figure 22.5.1
400
Colour index
The difference in brightness seen through different filters is a measure of the
colour of a star. A blue-white star will appear brighter (with a lower magnitude
value) through a filter that passes blue light than through a filter that passes only
astrophysics
red light. Using these magnitude measurements, the concept of colour can be
A colour index is the difference in the brightness of a
assigned a value.
star, in magnitudes, when measured through two different filters.
Many colour indices are possible, but the best known is the (B V) colour
index (or mB mV), the difference between the blue and visual apparent
magnitudes. The system is set up so that white A0 stars have (B V) = 0.
The colour index of the Sun is +0.66. Blue-white stars have small negative
colour index values, while orange-red stars have larger positive colour indices
(Figure 22.5.2).
Worked example
Question
Assuming the colour index of the Sun is +0.66, what is
the apparent B magnitude (or mB) of the Sun? What is
the absolute B magnitude (MB) of the Sun?
Crucis
V = +1.59
B V = +1.60
a Centauri A
V = 0.01
B V = +0.71
d Crucis
V = +2.79
B V = 0.19
Solution
From an earlier worked example, V = mV = 26.76 and
MV = +4.85.
Colour index = B V
0.66 = B (26.76)
Therefore B = 26.10
b Crucis
V = +1.25
B V = 0.24
b Centauri
V = +0.61
B V = 0.23
Similarly:
a Crucis
V = +0.77
B V = 0.24
e Crucis
V = +3.59
B V = +1.39
Colour index = MB MV
0.66 = MB 4.85
Therefore MB = 5.51
The fact that the blue magnitudes are greater than the visual magnitudes
implies the Sun is fainter in the blue, as expected for a yellow star.
Figure 22.5.2
Spectroscopic parallax
Measuring the colour index of a star places it within a narrow strip on
the HR diagram. Studying details of the spectral lines identifies the stars
luminosity class within that strip. With these two measurements we have
approximately located it on the diagram, without knowing its absolute
magnitude (see Figure 22.5.3). However, reading the vertical scale now reveals
absolute magnitude. Comparison of the absolute magnitude and the apparent
magnitude will then yield a distance, using the equation given earlier.
This method of determining distance is called spectroscopic parallax.
This is a poor name since parallax is not involved. This procedure is essentially
identifying the type of star from the characteristics of its light. Then, knowing
how bright such stars are, a distance can be calculated using the inverse
square law.
401
22
Measuring
the stars
7 000
6 000
4 000
10
10
Crucis
Crucis
4
Crucis
2 Centauri
Crucis
0
Giants (II, III) Crucis
Ma
in
seq
2
u
Subgiants (IV)
Sub ence
(V)
dw
Centauri A
arf
4
(VI
)
Sun
6
Centauri B
8
10
10
10
1
10
10
White
dwarfs
(VII)
12
10
10
Supergiants (I)
10
14
10
O5 BO
0.5
AO
FO
GO
Spectral class
KO
0.0
+0.3
+0.6
Colour index
+0.8
MO
Figure 22.5.3
+1.4 +2.0
Worked example
Question
One of the stars identified in Figure 22.5.2 is Crucis, the brighter reddish star in the
Southern Cross. Estimate its distance using spectroscopic parallax, given that it is a
M3.5 III star with V = 1.59 and colour index +1.60.
Solution
From the HR diagram in Figure 22.5.3, the absolute magnitude of Crucis is just above zero,
say 0.5.
Then:
d
m M = 5 log10
10
d
1.59 ( 0.5) = 5 log10
10
2.09
d
= log10
10
5
d
102.09 5 =
10
Checkpoint 22.5
1
2
3
402
PRACTICAL EXPERIENCES
astrophysics
CHAPTER 22
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Perform a first-hand
investigation to examine a
variety of spectra produced by
discharge tubes, reflected
sunlight, or incandescent
filaments.
403
22
Measuring
the stars
PRACTICAL EXPERIENCES
Activity 22.4: Temperature prediction
Examine each of the given black body radiation curves of stars and determine their
surface temperature.
Discussion questions
1 State the relationship between peak wavelength and surface temperature
of a star.
2 Outline the method that you followed in order to determine the
temperature of the star.
Observe a light through different filters to determine how brightness changes when
only selected colours are viewed.
Equipment: light meter, 2 red filters, 2 blue filters, light source.
Discussion questions
1 Compare the brightness of a red star when viewed through a red filter with
that of the star when viewed through a blue filter.
2 Compare this with viewing a blue star through the filters.
3 Explain how the change in brightness when viewed through the filters
would affect the measured magnitude of each star.
Research some of the instruments and techniques used in astronomy that have
aided in developing our understanding of celestial objects.
Telescopes
Photomultipliers
CCDs
Optical fibres
Thin telescope mirrors and active optics
Adaptive optics
Optical interferometry
IR array detectors
Radio telescopes
Radio interferometry
X-ray mirrors
Scintillators
Balloon-borne telescopes
Space telescopes
Discussion questions
1 Outline what has been discovered or developed because of the use of the
item you have researched.
2 Discuss what technological developments had to occur in order for this
piece of equipment to be used.
404
Chapter summary
astrophysics
c
Photometry is the measurement of the brightness of an
astronomical source.
Each stellar magnitude represents a factor of about 2.5
in brightness. A difference of five magnitudes is
equivalent to exactly 100 times in brightness.
Apparent magnitude is a measure of how bright an
object appears in our sky.
Absolute magnitude describes how bright a star would
appear in our sky if moved to a standard distance of
10 pc (32.6 ly).
The overall colour impression from a star is a result of
both the spectrum from the star and the spectral
sensitivity of the detector.
A colour index is the difference in the brightness of
a star in magnitudes, when measured through two
different filters. The best known is the B V colour
index.
Spectroscopic parallax is a distance measurement made
by identifying the type of star from the characteristics of
its light and then comparing its apparent and absolute
magnitudes to derive a distance.
Review questions
Physically Speaking
On the right is a list of topics that have been
discussed throughout this chapter. Create a
visual summary of the concepts in this chapter
by constructing a mind map linking the terms.
Add diagrams where useful.
Parallax
Colour index
Distance
Colour
Spectroscopy
Temperature
Photometry
Absorption
spectra
Emission
spectra
Magnitude
Luminosity
Continuous
spectra
405
22
Measuring
the stars
Reviewing
Solving problems
(1000 pc).
440
420
400
Flux
380
360
340
320
300
350
400
450
500
550
600
650
Wavelength (nm)
Figure 22.6.1
Re
iew
406
Q uesti o
Stellar
companions
and variables
23
r1 =
m2
r
1 + m2
23
Stellar
companions
and variables
orbit of
star 2
star1
m1
r1
focus 2
focus 2
centre
of mass
r2
orbit of
star 1
star2
m2
Figure 23.1.1
Try this!
Worked example
Question
m1
m2
r1
Figure 23.1.2
r2
Solution
mSun = 1.99 1030 kg
mJupiter = 1.90 1027 kg 0.001 mSun
Use the equation:
rSun =
mJupiter
m Sun + mJupiter
The apparent size and shape of the orbit is interesting, but the real
importance of binary stars is the possibility of determining their masses. Mass
cannot be determined from spectroscopic information alone. We need to study
the gravitational effect on another object, and a binary star system provides a
laboratory in which to do this.
We can see how this works if we assume the orbit is circular and use equations
from section 2.2. The centripetal force needed to maintain circular motion of
star 1 around the centre of mass is provided by gravitational force between the
two stars:
Fgravity = Fcentripetal
Gm1m2
408
r2
m v2
= 11
r1
astrophysics
2 r1
We can include the period T of the orbit of star 1 by using v1 =
.
T
This gives:
2
Gm2 4 r1
=
r2
T2
mm22
m + m rr
(m1 1+ m22)
m1 + m2 =
or
4 2r 3
GT 2
r 3 G (m1 + m2 )
=
T2
4 2
This is a form of Keplers third law (section 2.2; see also in2 Physics @
Preliminary section 13.4) but involves the combined mass of both stars. It also
requires r, the true distance between the stars, with allowance made for any tilt
of the orbit relative to the plane of the sky. This means that the distance to the
system must be known.
If r1, the distance from star 1 to the centre of mass can also be determined,
then the masses m1 and m2 can be calculated individually by using a slight
rearrangement of the earlier result for r1. The problem is that only rarely do we
have enough information to do this.
Worked examples
Question
The Centauri system is 1.338 pc away. The A and B components orbit each other with
a period of 79.92 years and an average distance of 23.7 AU. What is the total mass of
the system?
Solution
Use
m1 + m2 =
m1 + m2 =
4 2r 3
GT 2
4 2 3.55 1012 m
binary status
In binary or multiple star
systems we generally call the
brightest star the primary and
assign it the letter A. The
secondary star is B and any
other stars in the system are C,
D etc. Unlike the system in
Figure 23.1.1, a binary system
is often drawn centred on the
primary, which is considered
stationary. For example, in
Figure 23.1.3 the system is
centred on Centauri A, with
the very close companion
Centauri B in orbit around it.
Notice that the orbit is
elliptical, although not the
same ellipse you would draw if
the motions were represented
relative to the centre of mass
(as in Figure 23.1.1). However,
even this elliptical orbit is only
the apparent orbit on the
plane of the sky. The real
elliptical orbit is tilted relative
to the plane of the sky.
(6.672 10 )(2.522 10 s)
11
Try this!
= 4.16 1030 kg
2.1 solar masses
This is usually written with a special symbol for solar mass as 2.1M.
In fact the orbit is highly elliptical, but the result is still valid.
Question
Measurements indicate that Centauri A has an average distance of 11.2 AU from the
systems centre of mass. What is the mass of each star?
409
23
Stellar
companions
and variables
Solution
Use
m2
1.68 1012 m
=
3.55 1012 m 4.16 1030 kg
r1 m2
=
r (m1 + m2 )
PHYSICS FEATURE
Centauri
Orbit
Period: 357.00 0.07 days
Semi-major axis: 25.30 0.19 milli-arc seconds
Eccentricity: 0.821 0.003
Inclination to the plane of the sky: 67.4 0.3 degrees
Primary
Absolute V magnitude:
Secondary
M1 = 3.85 0.05
Absolute V magnitude:
M2 = 3.70 0:05
20
15
orbit of Centauri B
10
5
0
5
10
Centauri A
10
15
20
25
Checkpoint 23.1
1
2
410
Draw a diagram of a binary system that shows the two stars, the centre of mass of the system and their elliptical
orbits around the centre of mass.
Determine where the centre of mass is located between Jupiter and its moon Callisto.
MJupiter = 1.901027kg
MCallisto = 1.08 1023kg
Average distance of Callisto from Jupiter = 1883000km
astrophysics
Visual binaries
A visual binary (or multiple system) can be resolved into two (or more)
stars by a telescope under sufficiently good seeing conditions. Like Centauri
(Figure 23.2.1), many of the brighter stars are doubles well known to amateur
astronomers. A few are merely chance alignments of unrelated stars, but most are
genuine binaries.
A simple calculation of angles will reveal that if a star is to be seen as a double
in a telescope with angular resolution about 1 arc second, the stars must be at
least tens of AU apart and the system must be relatively close. Such stars orbit
each other slowly; but over many years the orbital motion may become apparent.
More than 100000 visual double stars are catalogued, but the orbits of only a
few thousand are known.
B
a
1980
Astrometric binaries
Some stars are sufficiently close that their motion through space is apparent as
A few of these stars reveal
motion across the sky, that is, as proper motion.
their binary nature by the wobbling of their paths across the sky (Figure 23.2.2).
The centre of mass of such an astrometric binary follows a straight path, but
the individual stars appear to wobble as they orbit.
Few binaries are discovered in this way, since it usually requires long-term
observations of nearby stars. The Hipparcos astrometry mission (section 22.1)
revealed many new examples, because its very high precision position
measurements revealed the motions of stars much more quickly.
Spectroscope binaries
1970
1960
Sirius B
1950
1940
Sirius A
1930
1920
1910
Most binary systems are too distant to appear as either visual or astrometric
1900
centre of mass
However, the spectrum of an unresolved binary will have light
binaries.
contributed by both stars. As the stars orbit, one will typically have a component
Figure 23.2.2 Sirius is a visual and an
of its motion towards us, while the other is moving away. The light from the two
astrometric binary.
stars will show small blue and red Doppler
A is moving towards the
shifts that change as the stars orbit. Provided
observer: lines blue-shifted
A
B
the stars are close in brightness, the result is
B is moving away from the
a doubled-lined spectroscope binary in
A BA B
A B
observer: lines red-shifted
which each spectral line will appear doubled
by these small shifts in wavelength (Figure
B
A and B are moving across
23.2.3). At other times, the motions of the
the observers line of sight:
stars may be entirely across the line of sight
no Doppler shift
A
A&B
A&B
A&B
and show no Doppler shift and therefore
display only a single set of lines.
A is moving away from the
If the stars are significantly different in
observer: lines red-shifted
B
A
brightness, the spectrum of the brighter star
B is moving towards the
will flood the fainter one and only one set
observer: lines blue-shifted
B AB A
B A
of moving spectral lines may be visible,
Figure 23.2.3 The changing pattern of spectral lines in a spectroscopic binary
forming a single-lined spectroscopic binary.
(very exaggerated)
411
23
Stellar
companions
and variables
To be visible as a spectroscopic binary, the component of the orbital velocity
measured by the Doppler shift must be relatively large. This means that the stars
must be closemuch closer than most visual binaries, so they orbit quickly. Also,
the orbit must be orientated so that the orbital motion has a component in the
line of sight. This is not enough to really understand the system since, generally,
the tilt of the orbit to the line of sight is unknown.
The brighter star of Centauri is a singled-lined spectroscopic binary that has
now been resolved by the SUSI interferometer (see Physics Feature p 410). This
combination of spectroscopic and astrometric data is rare and valuable, since it
allows all the parameters of the system to be derived, in particular the mass.
Eclipsing binaries
PRACTICAL
EXPERIENCES
Activity 23.1
Visual magnitude
orbit of
secondary
1.6
1.8
2.0
2.2
2.4
2.6
2.8
3.0
3.2
3.4
3.6
primary
secondary
primary
eclipse
Figure 23.2.4
Figure 23.2.5
412
secondary
eclipse
2.867321 days
If the stars of a binary system are close together and orientated so that the
orbital plane is close to edge-on, then it may be seen as an eclipsing binary.
In these systems the stars regularly eclipse one another, periodically blocking out
some of the light from the system. To be well enough aligned with our line of
sight means that the stars are usually very close together, with orbital periods of
just a few hours or days. They are therefore usually also spectroscopic binaries.
They may be so close together that the stars are distorted by their mutual
gravitational forces.
The change in brightness is apparent in the light
curve of the systemsa plot of the apparent magnitude
of the system versus time (Figure 23.2.4) that repeats
with every orbit. If the stars are different, the two
eclipses will not be identical. A primary eclipse will
primary
result in a greater loss of light than the secondary
eclipse
eclipse, with the details determined by the tilt of the
orbit, the relative size of the stars, their surface
temperatures and even the structure of their
atmospheres.
The importance of eclipsing binaries lies in the
wealth of information we can glean from observations,
including masses and distances.
astrophysics
10 000
1000
100
10
1
0.1
L M 3.5
0.01
0.001
0.1
10
100
Mass (M/M )
Figure 23.2.6
Checkpoint 23.2
1
2
Recall the different types of binary systems and outline the characteristics of each type.
Explain the significance of the massluminosity relationship.
GROUP
TYPE
CLASS
Cepheids
PULSATING
STARS
INTRINSIC
VARIABLE
Type II
W Virginis
RV Tauri
Long-period
variables
ERUPTIVE
(cataclysmic)
STARS
VARIABLE
STARS
RR Lyrae
Type I
Classical
Mira type
Semiregular
Supernovae
Novae
Dwarf novae
Symbiotic stars
Flare stars
R Coronae Borealis
T Tauri stars
ECLIPSING
BINARIES
EXTRINSIC
VARIABLE
ROTATING
VARIABLES
Figure 23.3.1
23
Stellar
companions
and variables
Effective temperature (K)
30 000 10 000
7 000
6 000
4 000
10
10
Type I
4
Absolute magnitude (Mv)
RV Tauri
Semiregular
stars
Classical Cepheids
Long-period
variables
Type II
RR Lyrae
Ma
W Virginis
10
10
equ
enc
T Tauri stars
Instability
strip
10
Miras
in s
10
10
10
10
12
Flare stars
14
10
10
O5 B0
0.5
Figure 23.3.2
A0
F0
G0
Spectral class
K0
0.0
+0.3
+0.6
Colour index
+0.8
M0
+1.4 +2.0
Non-periodic variables
The non-periodic variables cover a wide range of
different types of stars that are physically very different,
as illustrated by the systems described in Table 23.3.1.
Brightness change
Physical description
Dwarf novae
Symbiotic stars
R Coronae Borealis
T Tauri
414
astrophysics
Periodic variables
Periodic variables show regular or semi-regular changes in brightness,
with periods ranging from hours to hundreds of days. The major types are
shown in Table 23.3.2. Their brightness changes as the stars pulsate in size,
surface temperature and colour.
The pulsation arises from a slight instability in the balance between the
inward pull of gravity and the outward pressure of the gas and radiation
(see in2 Physics @ Preliminary section 15.4). This instability only occurs in the
outer layers of the stars and does not affect the energy production in the core.
Conditions in pulsating stars are just right to allow the pulsations to continue
instead of dying away, as they would in most stars. These conditions are found
in stars in the instability strip on the HR diagram, with another similar zone
of long-period variables among the red giants (Figure 23.3.2).
Table 23.3.2 Major classes of pulsating variables
Variable type
Brightness change
Physical description
Cepheid
Luminous yellow
supergiants
Old giants stars with
MV +0.6
Yellow supergiants
RR Lyrae
RV Tauri
Mira
Semi-regular
Naming of stars
Checkpoint 23.3
1
2
415
23
Stellar
companions
and variables
Brightness
PERIOD
Time
Figure 23.4.1
Type I (Classical)
Cepheids
104
Luminosity (L )
Type II (W Virginis)
Cepheids
103
102
RR Lyrae
1
0.5
Figure 23.4.2
1
10
Period (days)
416
30 50
100
astrophysics
Worked examples
Question
(zeta) Geminorum is a Type I Cepheid variable star in the constellation of Gemini. It varies
in brightness between 3.7 and 4.2 magnitudes every 10.2 days. Using the periodluminosity
relation, estimate its absolute magnitude.
Solution
Using the arrows on Figure 23.4.2, a period of 10.2 days indicates
a luminosity of approximately 3500L.
This can be converted to a magnitude difference using:
I
mB m A = 2.50 log10 A
IB
but working with absolute magnitudes. MV for the Sun is +4.85, so:
4.85 MA = 2.50log10 (3500)
and thus MA is 4.0.
Question
Using Geminorums average apparent magnitude, estimate its distance.
Solution
If we assume the average apparent magnitude is +3.95 then, using the relation between
absolute and apparent magnitude:
d
3.95 ( 4.0) = 5 log10
10
d
7.95 5
Then = 10
and d 390 pc
10
This is close to the accepted value.
Checkpoint 23.4
1
2
3
417
23
Stellar
companions
and variables
PRACTICAL EXPERIENCES
CHAPTER 23
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
Perform an investigation to
model the light curves of
eclipsing binaries using
computer simulation.
Chapter summary
418
Review questions
astrophysics
Physically Speaking
Complete the passage below by filling in the missing words from this list:
brightness, companion, eclipse,
eclipsing binaries, eruption,
extrinsic, intrinsic, magnitudes,
periods, pulsating variables,
pulsation, rotation, variability
Variable stars are stars that change ______________. This change can range from
0.001 to as much as 20 ______________ over ______________ of a fraction of a
second to years. There are a number of reasons why the brightness of variable
stars will change. ______________ swell and shrink due to internal forces, while a
star in an ______________ will dim when it is eclipsed by a faint ______________ and
then brighten when the occulting star moves out of the way.
Variable stars are classified as either ______________, when variability is caused
by physical changes such as ______________ or ______________ in the star or stellar
system, or ______________ when ______________ is caused by the ______________ of
one star by another or by the effects of stellar ______________.
Reviewing
1 Define what is meant by the term binary star.
2 Discuss why binary star systems are important in astronomy.
3 Draw a typical light curve that would result from an eclipsing binary
system that consists of two identical stars.
419
23
Stellar
companions
and variables
Solving problems
13 If a binary system consists of a 0.5M red dwarf star and a 3M red giant
star separated by 20 AU, calculate the location of the centre of mass.
4 r
GT 2
m1 + m2 =
2 3
14 Sketch the orbits around the centre of mass of the 0.5M and 3M stars
in the previous question.
15
Apparent magnitude
4.2
4.4
4.6
4.8
5.0
0.3
0.0
0.3
0.6
0.9
1.2
1.5
1.8
2.1
2.4
Time (days)
Figure 23.5.1
Re
iew
420
Q uesti o
astrophysics
PHYSICS FOCUS
Variables and supergiants
by Jonathon Nally
For most of the 20th century, professionals and
amateurs were separated by a wide gap in technology.
Only the professionals could afford the large
telescopes and complex support gear required for
modern astronomy and astrophysics. That gap has now
closed to some degree. Amateurs now have affordable
access to sophisticated telescopes, electronic
detectors, software, and all the other paraphernalia
needed to contribute to many areas of astronomy.
But its great to see that theres still a place for the
good old human eye. And this is nowhere more evident
than in the field of variable star research.
Some variable stars change brightness over a period
of days, weeks, months or even years. Many change
their brightness with perfect reliability, while others are
totally unpredictable. This means that constant, longterm monitoring is needed, which is the sort of thing
that professional astronomers really are unable to do
but its a job perfectly suited to amateurs.
Every night, all over the world, scores of dedicated
variable star observers are outside, making
observations and brightness readings of hundreds of
variable stars. They tabulate their data and send them
in to one of several umbrella organisations scattered
around the world. The data is then made available to
the professionals who, it cannot be stressed enough,
rely so heavily on this work by the amateurs.
Usually, it takes many years of patient monitoring
for enough data to be built up to make a difference in
research programs. But sometimes things can happen
much, much faster.
On the night of 6 January 2002, Aussie amateur
astronomer Nick Brown spotted something unusual.
A certain star was much brighter than it should have
beenV838 Monocerotis had gone into outburst.
It was a nova, and Nick was the first person to spot it.
He quickly reported it, and it came to the attention
of a bunch of professional astronomers who were
interested in just this particular kind of star. For the
past year, this team has been studying V838, and
20 May 2002
2 September 2002
28 October 2002
17 December 2002
Figure 23.5.2
Extension
3 V838 Monocerotis is now well known because of
images taken by the Hubble Space Telescope
(Figure 23.5.2) showing a light echo from the
outburst. Define the term light echo.
4 Recount what has happened to V838 Monocerotis
since the time of the radio broadcast in 2003.
5 Propose some other fields of astronomy in which
you think amateur astronomers can make valuable
contributions.
421
24
Birth, life
and death
Stellar evolution: How do we know?
Astronomers talk about the evolution of a star, but not quite in the
sense a biologist uses the word. Astronomers mean that an individual
star transforms itself during its lifetime as it consumes its nuclear fuel.
However, stars have also evolved over generations of birth and death.
Stars born today are somewhat different from stars born closer to the
time of the Big Bang.
How do we know about the evolution of the stars? Its a little like
biology in that we almost never see evolution happening in our
lifetimes, but the evidence is there to be found. Some
of the ancient stars are still around to see today.
astrophysics
The gas is about 70% hydrogen and 28% helium (if measuring
mass), with traces of other elements and molecules. This is spread
through several different components of the ISM with very different
properties (Table 24.1). Of particular interest are the cold giant
molecular clouds in which the gas is most dense, and simple molecules
can form. Individual molecular clouds have masses up to millions of
solar masses. They make up only about 1% of the volume of the ISM
but contain 90% of the mass and are the sites of star formation.
Where massive stars have formed in a giant molecular cloud, their
intense ultraviolet (UV) radiation eats into the cloud and creates an
emission nebula. Elsewhere the embedded dust may be revealed by dark
nebulae and reflection nebulae.
The dust is just a fraction of a millimetre in size and represents
about 1% of the mass of the gas. Its characteristics vary with location,
but it seems to consist of silicate or carbon grains, sometimes with a
coating of various ices. At least some of it is formed in the cool outer
atmospheres of red supergiant stars and blown outwards by the stars
stellar wind.
The ISM is part of a giant recycling system that includes the
stars (Figure 24.1.3). Gas from the ISM forms stars, is processed by
nuclear reactions inside the stars, and some is returned to the ISM when
the largest stars age and die.
Table 24.1.1 Major components of the ISM, with the air around you included for comparison
Component
Temperature (K)
300
2050
50150
103104
105106
Figure 24.1.2
neutral atom
clouds
ionised clouds
molecular clouds
ISM
~10
1091011
106109
~104
2
10 103
stellar outflows
and explosions
Checkpoint 24.1
1
2
3
star formation
nuclear fusion
in stars
Figure 24.1.3
423
24
Birth, life
and death
4
3
converted to kinetic energy and heats the gas while the
9M
104 years 10
2
rest is radiated as infra-red light. As each core gets
5M
105 years
2
10
0
hotter, the gas pressure increases, slowing the collapse
3M
Ma
2
in s
106 years 10
of the core while the surrounding gas continues to fall
equ
enc
inwards. The collapsing fragment of the cloud is now a
e
4
1
protostar, probably one of many, perhaps destined to
1M
6
1
10
form part of a binary system or even an entire cluster
8
ZAMS
7
2
of stars. It is a luminous source of infra-red light
10
10
10
0.5M
years
4 000
7 000
30 000 10 000
jet
protostar
outflow
inflow
accretion disc
jet
Figure 24.2.2
424
astrophysics
Checkpoint 24.2
1
2
3
4
Mass
(m)
O5
B0
A0
F0
G0
K0
M0
40
15
3.5
1.7
1.1
0.8
0.5
Luminosity
(l)
400000
13000
80
6.4
1.4
0.46
0.08
Surface temperature
(K)
Radius
(r)
40000
28000
10000
7500
6000
5000
3500
13
4.9
3.0
1.5
1.1
0.9
0.8
1.0
11
440
3000
8000
17000
56000
To understand why the high-mass stars are so rash with their fuel, compared
to their more frugal lower mass cousins, we need to consider the two processes at
work in main sequence stars converting hydrogen into helium: the proton
proton chain and the carbonnitrogenoxygen cycle (see in2Physics
In both cases the net reaction is a combination
@Preliminary section 15.4).
of four hydrogen nuclei (protons) into one helium nucleus with the release of
energy and some other light particles. Energy is available because the mass of the
four hydrogen nuclei is more than that of one helium nucleus. The lost mass is
converted to energy according to Einsteins famous equation E = mc2.
425
24
Birth, life
and death
1H
1
2H
1
1
1H
1
1H
3
2He
1H
1
1
1H
4He
2
3
2He
1H
1
1
1H
2H
1
proton
neutron
positron
neutrino
gamma ray
1
1H
Figure 24.3.1
12
6 C
1H
1
4
2 He
12
C re
cyc
led
12
6 C
13
7 N
1H
1
15
7 N
13
6 C
positron
neutrino
gamma ray
15
8O
1H
1
Figure 24.3.2
14
7 N
1H
1
3
2
H + 11 H 21 H + e + +
2
1
H + 11 H 23 He +
He + 23 He 42 He + 2 11H
C + 11 H 137 N +
13
7
13
6
14
7
1
1
C + 11 H 147 N +
N + 11 He 158 O +
15
8
15
7
N 136 C + e + +
O 157 N + e + +
N + 11 He 126 C + 42 He
astrophysics
ionised clouds
ISM
STAR FORMATION
High mass 8M M
Red supergiant
He fusion in core via triple- process,
H fusion in shell
Type II supernova
produces heaviest elements
Black hole M 3M
No fusion reactions
Red giant
He fusion in core via triple- process,
H fusion in shell
Planetary nebula
White dwarf
M 1.4M
No fusion reactions, slowly cooling
Figure 24.3.3
7 000
6 000
4 000
10
2 million years
initial 30M .
5
10
Supergiants (I)
4
10
30 million years
initial 10M .
4
Absolute magnitude (Mv)
Brown dwarf
No ongoing fusion
Neutron star
1.4M M 3M
reactions
No fusion
Climate prediction:
getting hotter!
3
10
2 initial 5M
initial 3M
2
10
Ma
in s
Subgiants (IV)
enc
e (V 11 billion years
)
present sun
initial 1M
equ
1
4.6 billion years
56 billion
1
10
years
ZAMS
8
10
White
dwarfs
12
initial 0.5M
(VII)
14
Figure 24.3.4
10
2
10
3
10
4
10
O5 B0
0.5
A0
F0
G0
Spectral class
K0
0.0
+0.3
+0.6
Colour index
+0.8
M0
+1.4 +2.0
Checkpoint 24.3
1
2
3
Explain why the lifetime on the main sequence is different for stars of different mass.
Write the net nuclear reaction equations that summarise the protonproton chain and CNO cycle.
If the CNO cycle produces negligible energy at 13 million K, explain why it can dominate energy production at
20 million K.
427
24
Birth, life
and death
Eventually a main sequence star runs very low on hydrogen fuel in its core. For
a star like the Sun, there is no mixing of new gas into the core and no increase in
the reaction rate can counteract the lack of fuel. The core collapse accelerates,
raising the temperature as gravitational potential energy is released, and igniting
hydrogen fusion in a shell around the helium core.
Computer modelling shows that the outer layers of the star respond by
expanding outwards and cooling. The plotted position on the HR diagram rapidly
moves to the right across the sub-giant range (Figure 24.4.1). The luminosity of the
star then begins to grow and the outer layers inflate further as the star climbs the
red giant branch. About 1.3billion years after leaving the main sequence, the G
class V main sequence Sun becomes a K or M class III giant, about 200 times its
current size.
The star now has a dense core that is perhaps one-third its original size, but the
grossly extended outer layers reach out to nearly envelope the orbit of the Earth.
These outer layers are of very low density and only weakly held. As a main sequence
star, the Sun looses mass at a rate of ~1014M per year as solar wind (see in2 Physics
@ Preliminary section 16.4). As a red giant, the mass loss will be perhaps 107 times
larger, and it will simply blow away a significant fraction of its mass.
In the core of the red giant the temperature eventually rises to about
100 million K and the fusion of helium into carbon begins via the triple alpha
process:
4
He + 42 He 48 Be +
2
8
4
Be + 42 He 126 C +
428
astrophysics
6 000
4 000
10
5
10
Supergiants (I)
planetary nebula
AGB
6
4
H He
2 initial 5M
0
He C
H He
helium
ignition
RGB
HB
helium
flash
AGB
Giants (II, III)
He C HB
H He
Ma
in s
equ Subgiants (IV)
enc
e (V
)
H He
2
4
RGB
H He
8
White
dwarfs
(VII)
12
no nuclear
reactions
2
10
10
3
10
10
initial 1M
4
10
initial 0.5M
14
1
10
30 000 10 000
2
10
3
10
4
10
O5 B0
0.5
A0
F0
G0
Spectral class
K0
0.0
+0.3
+0.6
Colour index
+0.8
Figure 24.4.1
M0
+1.4 +2.0
PHYSICS FEATURE
When is a star not a star?
Figure 24.4.2
24
Birth, life
and death
Checkpoint 24.4
1
2
3
Outline the stages in the evolution of a solar mass star from main sequence star to white dwarf.
Describe the processes by which a star loses sufficient mass to end its life as a white dwarf.
Recall the characteristics of a white dwarf.
430
For any star the basic problem is the same throughout its life. To be stable,
it must generate enough energy to ensure that internal pressure balances the
gravitational force. In a massive star, this means higher core temperatures to
generate the energy required and a correspondingly shorter lifetime.
For stars that start life with more mass than the Sun, but still less than about
8M (the middle of spectral class B), evolution is faster than for lower mass stars
(Figure 24.4.1), but they have the same fate in store. They lose more mass via
stellar winds and all end up below the 1.4M limit for the mass of a stable
white dwarf.
Stars starting life more than about 8M evolve much more quickly
Describe the types of nuclear
and have a different ending (Figure 24.3.3). Their main sequence lifetimes are
reactions involved in
measured only in millions of years (Table 24.3.1). Even at this stage, their stellar
Main Sequence and post-Main
winds lead to significant mass loss (perhaps 106M per year for a 60M star).
Sequence stars.
Their evolutionary path is flatter on the HR diagram (Figure 24.5.1), indicating
Discuss the synthesis of
elements in stars by fusion.
that their size changes dramatically but their luminosity does not. These stars
become supergiants, perhaps 1000 times the size of the Sun. They cross back
and forth across the instability strip, becoming Cepheid variables for part
of their lives.
In their cores, the onset of helium burning is gradual,
Effective temperature (K)
but as the temperature increases to more than 300 million K
30 000 10 000 7 000 6 000
4 000
10
supernova
the fusion of carbon with helium to produce oxygen
AGB
5
helium
10
He C
8
ignition
becomes the dominant reaction. When the temperature
H He
HB
RGB
4
6
climbs above 500 million K, carbon nuclei can fuse together
10
Supergiants (I)
H He
4 initial 10M
to produce sodium, neon and magnesium. All the carbon is
3
10
2
consumed quickly and the core collapses further, pushing
2
10
the temperature ever higher. Ultimately, fusion of silicon
0
Giants (II, III)
Ma
in s
produces iron at core temperatures of 7 billion K.
2
10
equ Subgiants (IV)
enc
The core of the supergiant resembles the layers of
e
4
(V)
1
an
onion
in which an iron core is surrounded by shells in
6
1
which
silicon
and sulfur, oxygen and carbon, helium and
10
8
hydrogen are all undergoing fusion reactions (Figure
2
10
10
White
24.5.2). However this is a fleeting phase. A 25M star will
dwarfs
3
12
10
(VII)
fuse hydrogen on the main sequence for about 7 million
14
4
10
years. It then spends 500000 years also consuming helium,
O5 B0
A0
F0
G0
K0
M0
600 years consuming carbon, half a year consuming oxygen
Spectral class
and just a day consuming silicon.
0.5
0.0
+0.3
+0.6
+0.8
+1.4 +2.0
Colour index
Synthesis of iron is the limit of normal fusion processes,
since
any further reaction costs the star energy rather than
Figure 24.5.1 Evolution of high-mass stars after they leave
generating it, accelerating its collapse. Within seconds a
the main sequence
astrophysics
portion of the core the size of the Earth collapses to just a few
kilometres across, achieving densities of 1017 kg m3. The speed
Fe core
core
of collapse causes the core to bypass the electron degeneracy
region
At 1012 K, degenerate
pressure that supports a white dwarf.
neutron pressure finally halts the collapse at the centre; however,
the surrounding layers hit the core and bounce back, blowing the
star apart in a supernova explosion. In a few seconds 1046 J of
energy is released, 99% of it as elusive neutrinos. Just 1% of the
H shell
supergiant star
Si, S shell
He shell
energy appears as visible light, but this is sufficient for it to
C, O shell
outshine all other stars in the galaxy for a few days!
Figure 24.5.2 The shell structure in the core of a high-mass star
During the explosion, iron nuclei are ripped apart and
just before it explodes
a large number of neutrons are released. These neutrons can be
captured by existing heavy nuclei to produce heavier nuclei than
Explain the concept of star
could be produced by any process earlier in the stars history.
death in relation to:
The blast wave and several solar masses of gas ejected by the supernova
supernovae
explosion speed outward at several thousand km s1, sweeping up gas blown away
neutron stars/pulsars
earlier by the stellar wind (Figure 24.5.3). The result is a supernova remnant
black holes.
(Figure 24.5.4), glowing across the full range of wavelengths and carrying gas
enriched in a full range of heavy elements back into the ISM. After perhaps
100000 years, the expanding remnant fades and merges into the ISM, but it may
have already triggered a new burst of star formation nearby.
Back at the site of the explosion, some of material very quickly falls back onto
Computer simulations remain very
the core of degenerate neutrons.
uncertain, but they indicate that stars with initial masses of between 8M and
roughly 25M will produce a core of less than 3M. The core will then be stable
as a neutron star. Much rarer stars with initial masses of between roughly 25M
and 40M will have more material falling back. The core mass will exceed the
3M limit of stability for a neutron star. No known force will stop the collapse
and the result is the formation of a black hole. Computer simulations suggest
that even more massive stars may form a black hole before an explosion can occur!
Figure 24.5.3
1994.4
2000.9
2005.2
2008.3
Figure 24.5.4
24
Birth, life
and death
PHYSICS FEATURE
rotation axis
magnetic axis
beamed
radiation
Black holes
astrophysics
Checkpoint 24.5
1
2
3
Figure 24.6.1
Figure 24.6.2
433
24
Birth, life
and death
Figure 24.6.3a is an HR diagram for the open cluster M44. The line on the
diagram estimates the ZAMS. Most of the stars are clearly still on the main
sequence. However, stars at the high-mass end are starting to evolve away from
the main sequence, and a few more-evolved supergiants are apparently also
present. This is consistent with our expectations that stars move off the main
sequence to the right on the HR diagram and massive stars evolve most quickly.
Figure 24.6.3b is a rather different HR diagram for the globular cluster M92.
The turn-off point is much lower down the main sequence, and a well-developed
The turn-off point is an indicator of the age of the
giant branch is apparent.
cluster; the further down the main sequence the turn-off point is, the older the
cluster. The M44 cluster is estimated to be about 600 million years old, while the
M92 cluster was formed early in the universeabout 13 billion years ago.
a
4.0
12
6.0
14
8.0
V
V 10.0
16
18
12.0
PRACTICAL
EXPERIENCES
Activity 24.1
ZAMS
ZAMS
20
14.0
22
0.0 0.4 0.8 1.2 1.6 2.0
BV
1.6
2.0
Figure 24.6.3 Colour index versus apparent magnitude for stars in the (a) M44 cluster and
(b) M92 cluster
PHYSICS FEATURE
A final thought
Checkpoint 24.6
1
2
434
Explain how the HR diagrams of star clusters support our understanding of stellar evolution.
If black holes emit no light, explain how we can observe them.
PRACTICAL EXPERIENCES
astrophysics
CHAPTER 24
This is a starting point to get you thinking about the mandatory practical
experiences outlined in the syllabus. For detailed instructions and advice, use
in2 Physics @ HSC Activity Manual.
10R
100R
Blue giants
Luminosity (L )
1000R
Supergiants (I)
104
103 1R
Red giants
Dwarfs (V)
Giants (III)
102
101
0.1R
100
101
0.01R
102
103
Red dwarfs
White dwarfs
0.001R
104
40 000
20 000
10 000
5000
2300
Temperature (K)
Figure 24.7.1
Simulation of HR diagram
435
24
Chapter summary
Birth, life
and death
The space between the stars is filled by the gas and dust
of the interstellar medium (ISM).
The gas of emission nebulae is energised by hot young
stars to emit an emission spectrum.
The dust of dark nebulae scatters starlight, reddening or
completely blocking our view.
The light of reflection nebulae is scattered by dust,
especially at blue wavelengths.
Giant molecular clouds are regions where the gas is most
dense and simple molecules can form. They are the sites
of star formation.
Gas from the ISM forms stars, is processed inside the
stars, and some is then returned to the ISM.
Giant molecular clouds can be pushed into gravitational
collapse by triggers such as supernovae.
Protostars warm up as infalling gas releases gravitational
potential energy and raises the central temperature until
nuclear fusion of hydrogen into helium begins.
Surrounding the protostar is an accretion disc and jets
of outflowing material.
Stars are born onto the zero-age main sequence (ZAMS).
The location of a star on the ZAMS, the time to get
there and its subsequent evolution are almost entirely
determined by the mass of the star.
The protonproton chain and the carbonnitrogen
oxygen cycle both combine four hydrogen nuclei into
one helium nucleus, with the release of energy and some
other light particles.
When a main sequence star runs very low on hydrogen
fuel in its core, its core shrinks and is surrounded by a
shell of hydrogen fusion, while the outer layers inflate
to create a red giant.
When the temperature of the core of a red giant is
greater than 100 million K, the fusion of helium into
Review questions
Physically
Speaking
436
Term
Definition
Planetary nebula
Supernova
Protostar
ZAMS
White dwarf
Neutron star
astrophysics
14 Compare the HR diagrams of an open cluster and
a globular cluster.
4 000
5
10
Supergiants (I)
4
10
6 000
7 000
10
3
10
2
0
Ma
2
4
2
10
in s
equ
10
1
10
2
10
White
dwarfs
(VII)
10
12
3
10
14
O5 B0
0.5
4
10
A0
F0
G0
Spectral class
K0
0.0
+0.3
+0.6
Colour index
+0.8
M0
+1.4 +2.0
Figure 24.7.2
Solving problems
16 The Sun is mainly powered by nuclear reactions of
the protonproton cycle. Each reaction contributes
4.2 1012 J to the solar luminosity of
3.84 1026 W.
a Calculate how many hydrogen atoms fuse to form
helium every second to produce this luminosity.
b If the mass of the proton is 1.67 1027 kg,
calculate how much mass this represents.
c Assuming 70% of the Suns initial mass of
1.99 1030 kg was hydrogen, calculate how
many years it will take to convert all the hydrogen
to helium.
d Contrast this result to the expected lifetime of
the Sun. Explain any discrepancy.
iew
Q uesti o
Re
Reviewing
437
Multiple choice
(1 mark each)
1 Which of the following telescopes must be used
above Earths atmosphere to observe astronomical
objects?
A Optical
B Radio
C X-ray
D Infra-red
438
Short response
6 Describe the problems associated with viewing
astronomical objects through ground-based optical
telescopes. (2 marks)
astrophysics
Extended response
12 Outline an experiment you undertook in class to
observe emission, absorption and continuous spectra.
Describe the differences you observed between the
spectra, and identify an object that produces each
type of spectra. (3 marks)
7 000
6 000
4 000
10
10
8
A
10
10
Ma
in s
equ
10
10
enc
1
1
10
10
10
12
14
10
10
O5 B0
0.5
A0
F0
G0
Spectral class
K0
M0
0.0
+0.3
+0.6
Colour index
+0.8
+1.4
+2.0
Figure 24.8.1
439
7
Context
Figure 25.0.1
440
skills
In the preliminary text you were introduced to many skills that are needed to get
through Physics. In the HSC year it is important to further develop these skills, and
to make sure you know how to use the skills without thinking twice. You will now be
able to fine tune and add to what you have learnt.
One of the most important skills for a physicist is being unambiguous, because
the mathematical nature of physics demands that everything be defined strictly and
clearly. Its also important as a scientist to have the skill of communicating clearly
the results of research to other scientists.
The HSC year is about clearly getting across what you have learnt. It is important
not to be ambiguous, so you must learn how to interpret questions and construct
your answers. Throughout this module we will focus on the skills you need to make
sure what you have learnt is conveyed clearly.
Inquiry Activity
The Bungee jumper
Lets say you are responsible for getting the bungee cord length (lo)
correct for the next jumper.
The elasticity (k) of the cord is always known. For simplicity,
assume that the energy conversion is close to 100% efficient.
You are given the chance to take any measurements of the jumper.
How could you calculate the length of the cord you need to attach
to the jumper if the bridge stands 100 m above the river below and
the jumper is to stop between 10 and 100cm from the water for
maximum thrill. Not too close for safety, but not too far for thrill.
Figure 25.0.2
A bungee jumper
Hints
A sketch of the situation (Figure 25.0.3) has been given to you to help you
along the way.
1 List the types of energy that is available at the top of the jump
1 22
EPE== kx
kx
(this will include gravitational U = mgh, elastic potential energy EPE
2
1 22
=
kx
and kinetic energy EEPE
=
mv
)
and
the
values
of
things
that
are
known.
K
2
2 Using the law of conservation of energy, write an equation that includes all
the types of energies you have listed.
3 hi is the maximum height that the cord can stretch to. It is made of the
original length of the cord plus the stretched length, i.e. hi = lo + x.
Figure 25.0.3 A sketch of the situation
4 Rearrange the equation to solve for the bungee cord length (lo).
Think about how to do this and then have a go at doing it in a scaled-down
experiment.
What will you need to know, or take measurements of? How accurate will you
need to be?
Set up your experiment to test it.
Questions
1 List any assumptions that you have made in your calculations.
2 Determine how it is possible for you to test your prediction accurately.
3 List problems involved with your experiment and how it is possible to
reduce them.
441
25
Skills
stage 2
Skills introduction
442
SKILLS
Sometimes you will need to convert from one prefix to another. The easiest way to
carry out the conversions is to follow the steps set out in the worked example below.
Worked example
Question
Convert 5.49mm to m.
Step 1: Express the number in scientific notation.
5.49 103m
Step 2: Look at the prefix that you need to convert the number to. Express the prefix in
scientific notation.
micro = 106
Number
Prefix
101
deci-
10
103
106
109
centimillimicronano-
c
m
1012
pico-
1015
femto-
10
18
atto-
10
21
zepto-
1024
yocto-
Solution
Symbol
10 m to 10 m
10
deka-
da
To change the exponent to 106, the decimal place must be moved 3 places to
the right.
102
hecto-
10
106
kilomega-
k
M
109
5.490 10
Checkpoint 25.1
1
2
3
giga-
12
10
tera-
1015
peta-
10
exa-
1021
zeta-
yotta-
18
24
10
*
*
*
*
*
*
25
Skills
stage 2
Worked example
Question
A driver of a car travelling at 16.0 m s1 slams on the brakes when he sees a ball on the
road in front of him. The ball is 25 m away at the time the brakes are applied. Will the car
stop in time, given that the deceleration is 2.5 m s2?
SOlution
Step 1: The data have been highlighted in the question.
Step 2: List them below.
u = +16.0 ms1
v=0
a = 2.5 ms2
s = +25 m
Even though our list shows that there is a displacement, this is not the actual
displacement of the car, but the initial displacement of the ball from the car.
s=?
0 = (16.0)2 + 2(2.5)s
s = 256/5
s = 51.2 m
Step 5: This is more than 25 m, therefore the car will not stop in time.
Recall from the preliminary course that the answer to a calculation needs to
be expressed correctly in order to reflect the accuracy of the measurements.
First you must look at the calculation itself. The calculation should be
completed with all the available digits in each value. When multiplication or
division is used, the answer should be expressed to the least number of
significant figures used in the values in the question. With addition and
subtraction, the answer is expressed to the lowest number of decimal places
within the question.
Remember that a calculator can invent accuracy. Just because the calculator
gives you an answer with 10 digits doesnt mean that the answer has become that
accurate. Refer back to the question and reduce your answer accordingly.
Worked example
Question
Correctly express the answer from the worked example above.
Solution
The equation has multiplication and division in it, therefore we will need to look at the
answer in terms of significant figures.
The least number of significant figures in the question is 2.
Therefore the answer to the calculation is 51m.
444
SKILLS
Checkpoint 25.2
1
2
An aeroplane has touched down on the runway at 290 kmh1. The braking ability allows it to decelerate at a rate of
11 m s2. The runway is 3.9km long. Will the aeroplane stop in time?
A satellite needs to be launched into a circular near-Earth orbit. At what height above the Earths surface would the
satellite need to be placed in order to maintain a period of 90 minutes?
Figure 25.3.1
445
25
Skills
stage 2
You may be asked in an exam to express how you made sure that your
experiment was valid and reliable, so make sure you do this when you are doing
the experiment (refer to Chapter 17 Physics skills in in2 Physics @ Preliminary).
Checkpoint 25.3
1
You are required to carry out an experiment that determines the reaction time of your lab partner. Hold a ruler just
above your partners fingers (see Figure 25.3.2). Your partner must catch the ruler after you drop it without warning.
Record how far the ruler has dropped by the time it is caught. Use your knowledge of physics to determine the
reaction time of your lab partner.
Repeat the process five times at an interval of 30s. Does your partner become better?
After a day or so, do a similar experiment, this time using a data-logger sensor. Does your partner become better?
2
3
4
Figure 25.3.2
Your partner must catch the ruler after you drop it without warning.
Checkpoint 25.4
During your study of Module 1 Space, you were required to research a scientist in relation to their contribution to space
exploration. Use this information to answer the following question: How has your scientist influenced those who have
come after him/her?
446
SKILLS
Checkpoint 25.5
For your chosen scientist, state the research carried out and explain how the research relates to the Physics course.
Worked example
Question
A student is looking at how the velocity v of a ball relates to its displacement s as it falls
through the air when dropped. To do this, the student drops a ball and records the start velocity,
the displacement it goes through and the end velocity.
What does the student need to graph in order to get a linear relationship?
Solution
The formula that we can use to relate these two variables is:
v2 = u2 + 2as
From this equation, we can see that there is not a linear relationship between v and s, but there
is a linear relationship between v2 and s. Graphing v2 against s would produce a straight line.
Checkpoint 25.6
1
2
L
.
g
What variables would need to be placed on the axes of a graph if a straight line was to result?
Determine the linear relationship between length and period from the formula T = 2
447
26
Revisiting the
BOS key terms
Verbs in action
In the Preliminary course, you were introduced to the idea of
grouping the BOS key terms in order to help you answer questions.
Below is a table with the BOS key terms in the allocated groups.
From working through the examples in the Preliminary course,
you should be very familiar with how to formalise your answers.
In the HSC year you will not have as much of an opportunity to fill
in a table and analyse your answer as completely as you have up to
now. Because you are now familiar with the structure of the answers,
we will introduce the short examination technique to help you. If you
are still not confident with structuring your answers, continue to work
as outlined in in2Physics @ Preliminary.
Group 2: Comprehension
Group 3: Application
Define
Account
Apply
Extract
Clarify
Calculate
Identify
Compare
Classify
Outline
Contrast
Demonstrate
Recall
Describe
Examine
Recount
Discuss
Distinguish
Extrapolate
Interpret
Predict
Group 4: Analysis
Group 5: Synthesis
Group 6: Evaluation
Analyse
Construct
Appreciate
Explain
Propose
Assess
Investigate
Summarise
Deduce
Synthesise
Evaluate
Justify
Recommend
448
SKILLS
Group 1: Knowledge
Group 2: Comprehension
Worked example 1
Worked example 2
Question
Question
During an experiment carried out in class, students had two currentcarrying wires, one placed near an electronic balance, the other on
it. They noticed that the readings fluctuated as they changed the
current moving through one of the wires. Discuss how these
readings changed in relation to the current.
Notes
Define falls into group 1, therefore this requires you to list
information.
Orbital velocity: tangential velocity of satellite when moving in
an orbit around the Earth
The speed of the satellite is given by the following formula:
Gm
r
This shows that the velocity is affected by the mass of the central
objectthe Earthand the radius from the centre of the object.
v=
Notes
Discuss falls into group 2, therefore this requires you to list
information and interpret details of topic.
A current-carrying wire has a magnetic field around it.
Two current-carrying wires near each other will attract or
repel, depending on the direction of the current.
Solution
26
Revisiting
the BOS
key terms
Solution
Two current-carrying wires will either attract or repel. If the current
is travelling in the same direction in both wires, the wires will
attract. If the current is in different directions, then they will repel.
The size of the force is determined by:
II
F
=k 12
l
d
As one wire is resting on the balance, the balance is registering
both the weight of the wire and the force created by the interaction
of the wires.
r3
2
86 400
r=
= 4.2 107 m
42000 km
4 2
Group 4: Analysis
Worked example 4
Notes
Worked example 3
Question
Calculate the radius of orbit for a geostationary satellite.
Notes
Calculate falls into group 3, therefore this requires you to list
information and interpret details of topic and apply knowledge.
Geostationary = 24 h, orbit = T
It is in orbit around the Earth, therefore M = 6.0 1024kg.
Equation that applies is
T
Solution
r3
T2
GM
4 2
r=?
M = 6.0 1024 kg
T = 24 60 60 = 86400s
G = 6.67 1011
450
Group 3: Application
r3
Question
GM
4 2
Solution
Both Hertz and Crookes observed properties of the cathode rays that
showed properties of both waves and particles. The main reason for
Hertz believing that the ray was a wave was from two experiments:
one showed that when an electric field was applied there was no
evidence of the ray moving, and the second showed that the rays
could pass through thin metal. This resembled the behaviour of
light, which was known to be a wave.
It was not until JJ Thomson performed the same experiment
again that the debate was put to rest. Thomson produced a nearperfect vacuum, eliminating any trace atoms within the tube, and
coated the end of the tube with a fluorescent screen. It was noted
that when the electric field was applied the ray was bent in the
direction in which a negative particle would move.
SKILLS
Group 5: Synthesis
Group 6: Evaluation
Worked example 5
Worked example 6
Question
Question
Notes:
Notes
E = hf
W = qV so measuring V will allow you to get a measure of E.
f can be varied by using coloured filters.
Set up apparatus as shown.
Measure the stopping voltage for each frequency.
Produce a graph of V against f.
Convert V to E by multiplying by 1.6 1019.
Determine the gradient of the graph. This is equal to h.
Solution
Set up the equipment as shown. (Draw a diagram like
Figure 9.3.1.)
Measure the stopping voltage for each frequency of light.
(Stopping voltage is the voltage to stop current flowing in
the circuit.)
Produce a graph of E against f. To determine E multiply V
by 1.6 1019. (Draw a graph like that shown below.)
The gradient of the graph is h.
EK
EK = hf
slope = h
threshold
frequency
Frequency
Solution
The Space Shuttle was the first reusable space transport. It was
designed to fly many missions and be reused quickly while
protecting those inside.
On take off, the g-forces that are applied to the astronauts are
massive. As they need to be able to withstand the g-force, the
astronauts lie down during take off, so the blood does not rush
away from their vital organs and they remain conscious throughout
the launch.
Most of the precautions are taken for the re-entry. On the
approach the astronauts must aim the shuttle to come in at an angle
of between 1 and 2, in order to avoid burning up in the atmosphere
or skipping off the atmosphere. The burning is due to friction. A small
amount of this has to occur, and so heat shields are used to minimise
the effects on the astronauts. The underside of the surface of the
Shuttle is covered with tiles that can be superheated and dissipate
the energy quickly. This avoids extensive heat inside and converts the
massive amounts of KE to heat quickly, slowing the Shuttle.
The nose cone of the Shuttle is fairly blunt. This also aids in the
conversion of KE to heat and therefore helps to slow the shuttle.
A pointed nose would be more likely to melt.
There is a limit to how much energy can be converted to heat
safely, so the final means of slowing down is the use of parachutes.
These parachutes are deployed on landing and allow the Shuttle to
slow and stop safely.
The main purpose of the Shuttle is to transport humans safely.
These precautions must be taken in order for the occupants
to survive.
451
Numerical answers
Numerical answers
These are selected numerical answers only. A complete set of
answers can be found in the Teacher Resource.
Module 1 Space
Checkpoints: 1.1 1 ah = 0, av = 9.8ms2 down 2 =45,
=90 3 drag 4 both at the same time. 6 see Table 1.1.1
1.2 1 F = Gm1m2)/d2 3 1/d2
1.3 2 EP = (Gm1m2)/r 3 0
Review questions: 3 2.5s 5 a 2Fi b 4Fi c Fi/4 8 11.2kms1
10 a 1.38s b 4.37m above ground c 18.0m (right)
d 14.3ms1, 25 below horizon 11 a 25.3ms1, 0.883s
b 17.9ms1, 1.25s 13 5.35ms1 14 260.5N, 89.99
from x-axis 15 7.0 m
17 5.30108J 18 10300ms1 19 a 42100ms1
Chapter 2
Checkpoints: 2.1 1 Sergey Korolyov, Wernher von Braun
6 a g-force = 1 b g-force = 0 c g-force = 0
2.2 3 ac = v2/R 8 True 9 a = r 10 Mercury
2.3 1 circle, ellipse, hyperbola and parabola 2 hyperbola
2.4 2 hyperbolic 3 they are equal 4 vmax = 2VP
2.5 2 false
Review questions: 15 6.25107 J, 2.121012m
16 a 10690kgs1 b 5.2ms2, 12.1ms2 17 a w = mg
b N = 0 c ac = g d v > 7.00ms1 e 0 19 32.7 h1
20 16.8 days 22 a 4.22107 m b 2.66107 m
23
KE
Geostationary 9.47109 J
Chapter 7
Chapter 1
Type of
orbit
Chapter 6
GmM
2a
Two-body PE
ME =
KE + PE
1.8871010 J
9.40109 J 9.44109 J
Coils in
primary
Primary
voltage (V)
Coils in
secondary
Secondary
voltage (V)
100
320
50000
25
6
240
393
200
66.7
30500
12
50
240
Current in
primary
coil (A)
Voltage in
primary
coil (V)
Current in
secondary
coil (A)
Voltage in
secondary
coil (V)
5
0.1
0.5
6
240
200
0.125
2
0.1
240
12
1000
2 9.6 1019N
Chapter 3
Chapter 9
Module 1 review
Multiple choice: 1 B 2 D 3 A 4 A 5 C
Chapter 10
Review questions: 14 a 3.06 1019 J b 3.27 1015
15 1.09 106 m 16 5.94 1020 J = 0.37 eV
Chapter 11
Module 3 review
Chapter 4
Checkpoints: 4.3 1 at 90 to each other
2 F B, F I, F l, F sin
Review questions: 13 4.3 17 3N 18 b gradient = B/F = 2.47
c 1/gradient = 0.4, Il = 0.4, 100%
Chapter 5
There are no numerical answers for this chapter.
452
Turns
ratio
Multiple choice: 1 A 2 D 3 A 4 C 5 B
1.491010 J 1.501010 J
Step-up
Step-down
Step-down
Module 2 review
Step-up or
step-down
0.025
20
0.2
Numerical answers
Chapter 13
Checkpoints: 13.2 2 2.4 109 m 3 6.56 1025 kg m s1
13.5 2 1.93 1010 m
Review questions: 12 b 2.4 1011 m 13 a 7.3 104 m s1
14 a i 1.2 109 m ii 6.6 1013 m 15 2.87 105 m s1
16 a i 4 1036 m ii 3 1034 m iii 3 1038 m
17 a + infinity b no 18 a 1.1 m
Chapter 14
Checkpoints: 14.2 1 a factor of ~ 1035 greater
14.6 2 5.2 1027 kg 3 280 MeV
Review questions: 14 a Fe = 231 N b Fg = 5.6 1034 N
15 0.034348 amu 16 a 28.4MeV b 0.03049amu
c 4.0026 amu 17 c 8.9 MeV per nucleon
18 a 102
(yttrium) b Uranium 7.6 MeV/nucleon,
39 Y
iodine 8.5 MeV/nucleon, yttrium 8.6 MeV/nucleon
1
0n
65
66
23
+ 29
19Cu
a a=29
b
Na +
11Cu
22
10 Ne
0
20 +1
17.406
e + v MeV
Chapter 15
Chapter 19
Review questions:
17 a U-238 Th-234:
Th-234 Pa-234:
Pa-234 U-234:
U-234 Th-230:
Th-230 Ra-226:
Ra-226 Rn-222:
Rn-222 Po-218:
Po-218 Pb-214: , Po-218 At-218:
At-218 Bi-214:
Pb-214 Bi-214:
Bi-214 Tl-210: , Bi-214 Po-214:
TI-210 Pb-210:
Po-214 Pb-210:
Pb-210 Bi-210:
Bi-210 Po-210:
Po-210 Pb-206:
18 a 3 years
Chapter 20
Review questions: 18 6 19 1.41 1026 J 20 2.8 1010 Hz
Module 5 review
Multiple choice: 1 C 2 B 3 B 4 B 5 B
Module 6 Astrophysics
Chapter 21
Checkpoints: 21.3 2 40 21.5 2 0.1 arc second
Review questions: 13 14 ~0.2 ly 17 ~90 ly
Chapter 22
Multiple choice: 1 C 2 C 3 B 4 D 5 A
Chapter 23
Module 4 review
Chapter 17
There are no numerical answers for this chapter.
Chapter 18
Review questions: 9 26.7 10 a 43.98 b 67.5
11 n2 = 1.33, water
Chapter 24
Review questions: 16 a 3.66 1038 b 6.11 1011 kg
c 72 billion years d ~6 times the expected lifetime of the Sun
Module 6 review
Multiple choice: 1 C 2 D 3 B 4 D 5 C
Short response: 8 a 1.4 b star B
Module 7 Skills
Chapter 25
Checkpoints: 25.1 1 a 450 mg (milligrams) b 0.345 kg
(kilograms) c 5.6 Mg (megagrams) 2 0.0045 Mm
3 0.00000000000054 km
25.2 2 300km above surface
T 2
25.6 1
g = L 2 s and t2
2
Chapter 26
453
Numerical answers
Glossary
Glossary
Glossary
core in a clad optical fibre, the higher index central fibre that carries
the light
core in a nuclear fission reactor the core houses the fuel rods,
control rods, a coolant system and a moderator material; it is
where fission occurs
Glossary
donor energy level an extra energy level created in the energy gap
just below the conduction band of a semiconductor as a result of
the introduction of donor impurities
457
Glossary
flux leakage loss of magnetic flux as it passes from the primary coil
through the secondary coil, resulting in loss of induction in the
secondary coil and, hence, loss of energy
gimbal a pivoted frame that allows rotation around more than one
axis; common example is the mount that supports a gyroscope
focal length the distance between a lens or mirror and the image it
forms of a distant object
Glossary
Keplers laws the three laws that govern the motion of planets
around the Sun
Glossary
Manhattan Project the code name for the US secret atomic bomb
project during World War II
460
Glossary
prism a wedge-shaped glass block that disperses visible light into its
component colours using refraction within the glass
461
Glossary
RF transceiver coils the coils that both transmit the RF pulse and
receive the RF echo of relaxing nuclei in MRI
Glossary
Glossary
visual binary a binary star system that can be seen as two stars
by a telescope under sufficiently good seeing conditions
464
Index
Index
A-mode ultrasound scans 312
ablating materials 49
absolute magnitude, of stars 399
absorption spectra, lines in 232, 393
AC electric motors 1214
synchronous 1301
AC generators
compared with DC 135
simple 1313
AC generators and transformers, affect on
society 1467
AC induction motors
activity 126
single-phase 124
three-phase 1223
AC power
generation and delivery 1424
losses during transmission and
distribution 1446
accelerated particle beams, uses of 288
acceleration due to gravity, activities 201
acceptor energy level 193
accuracy in experiments 445
acoustic coupling 312
acoustic impedance (Z) 31012
active optics in telescope mirrors 3778
adaptive optics in telescopes 37980
aether drag 64
aether model for light transmission 612
agricultural radioisotopes 284
air resistance (drag) 7
Airy disc 376
Algol ( Persei) eclipsing binary star system
412
alpha decay 264, 341
alpha () particles 341
alternating current (AC) 85
electric motors 1214
amplification of currents using triodes 198
Anderson, Carl, and cosmic rays 286
angle (), in motor effect 93
ngstrm, Anders, and hydrogen spectrum
233
angular resolution of telescopes 3767
anode glow 158
ANSTO (Australian Nuclear Science and
Technology Organisation)
Bragg Institute at 272
Echidna neutron diffractometer 285
National Medical Cyclotron at Royal
Prince Alfred Hospital 282
OPAL reactor facility at Lucas Heights
272, 282, 345
radiopharmaceutical production at 353
antennae, radio 1756
apparent magnitude of stars 398
apparent weight 312
artefact standards of mass and length 79
Aston dark space in discharge tubes 158
astrometric binary stars 411
astrometry 3889
astronauts, forces on during take-off 315
astronomical unit (AU) of distance 389
asymptotic giant branch 428
ATLAS particle detector in Sydney 2901
atom
Bohrs postulates for model of 2356
Rutherfords model 22830
atomic bombs 279
atomic mass, and the neutron 261
atomic mass number (A) 262
atomic mass unit (amu) 266
atomic number (Z) 262
atomic piles 270
atomic spectra
for hydrogen 2325
for larger atoms 239
atoms, historical understanding 229
Australia Telescope Compact Array (ATCA),
for interferometry 381
Avogadro project (CSIRO) 79
B-mode ultrasound scans 313
back emf in DC electric motors 120
ballistic trajectories 3
Balmer, Johann
Bohrs explanation of series 2368
emission spectrum series for hydrogen
233
band gap (forbidden energy gap) 189
barium emission spectrum 239
barium meal X-ray procedure 325
baryons 292
BCS (Bardeen, Cooper and Schrieffer) theory
215, 217
Bessel, Friedrich, and stellar parallax motion
388
beta () particles 341
beta decay 260, 3412
positron production 342
beta (minus) and (plus) decay 264
Centauri A and B orbits 410
bias, forward and reverse 1945
binary stars 40710
Centauri 410
types of 41112
binding energy of nucleus 2689
bipolar transistors 1989
black body absorbers and emitters 178
black body radiation 1789
black holes 431, 432
blood flow and Doppler effect 31516
Bohr, Nils
explanation of Balmer series for hydrogen
2368
postulates for atomic model 2356
and Rutherfords atomic model 230
bone-density measurement using ultrasound
315
bone scans with gamma camera 3467
465
Index
466
Index
gravity 1015
activity 20
effect on orbits 3540
gravity assist (slingshot effect) 445
hadrons, properties 294
Hahn, Otto
aand radioactive isotopes 269
and beta decay 265
half-life of radioisotopes 282, 343
Halley, Edmund, comet of 39
Harriot, Thomas, and telescope 368
heart imaging by echocardiography 316
heat shields 49
heating during space shuttle re-entry 4850
Heaviside, Oliver, and Maxwells equations
61
Heisenberg, Werner, uncertainty principle
of 2545
helium flash 428
Herschel, William, and reflector telescope
373
Hertz, Heinrich
and cathode rays 157
discovers photoelectric effect 182
measures speed of radio waves 61, 1757
verifies electromagnetic wave theory
1745
HertzsprungRussell (HR) diagram 3956
activity on star clusters 435
and distance of Crucis 402
life of star on main sequence 427
showing evolutionary tracks for protostars
424
showing variable stars 414
for star clusters 434
Higgs particle 291
high-mass stars, evolution of 4301
high-temperature superconductors 216
high-voltage DC (HVDC) transmission 146
Hipparchus, and star magnitudes 397
Hohmann transfer orbits 57
holes in valence bands 191
hollow structures, X-ray images of 325
homopolar electric motor 83
Hooke, Robert, law of 15
Hounsfield, Godfrey, and tomography 321,
327
hydrogen atom
Balmer series for 233
emission spectrum 239
spectra and energy levels of 2325
hydrogen bomb 280
hydrogen proton
as energy carrier 285
precession of 356
spin in magnetic field 3556
subjected to RF pulses 3579
hyperbolic orbits 412
hypersonic flight 489
impedance matching 312
induction cooking 108
see also electromagnetic induction
induction motors
single-phase AC 124
see also electromagnetic induction
industrial radioisotopes 283
inertial frames of reference 589
inertial and non-inertial frames of reference
activity 75
infra-red light 197
insulators
energy band diagrams for 18990
in high-voltage AC power transmission
1456
integrated circuits (ICs) 200
interference, constructive and destructive 205
interferometry
in MichelsonMorley experiment 624
in radio telescopes 3801
International Thermonuclear Experimental
Reactor (ITER), superconductors in 218
interstellar medium (ISM) 4223
intrinsic variable stars 414
inverse square law 11
isotopes
of atoms 2623, 341
radioactive decay of 3403
James Webb space telescope (JWST) 382
Jansky, Carl, and background radiation 373
Joliot-Curie, Irene, and radioactive isotopes
269
junction transistors 1989
Kamerlingh Onnes, Heike, liquefies helium
211
Kelvin, Lord (William Thomson), and
temperature scale 210
Kendall, Henry, and quarks 292
Kepler, Johannes
laws of planetary motion 359
telescope 3712
third law 409
Kirchhoff, Gustav, and spectral lines 390
Korolyov, Sergey (aka Sergei Korolev), USSR
space program leader 28
Kronig, Ralph, and electron spin 240
Lagrange points 29
landing of spacecraft 501
Large Hadron Collider (LHC) at CERN
288, 2901
Larmor frequency 356
laser diodes 195
Laue, Max von, X-ray diffraction patterns
2079
launch angle of projectile 6
law of universal gravitation 11
in finding new planets 40
in predicting small deviations in planetary
orbits 39
Lawrence, Ernest, and linear accelerator 287
length contraction 6970
length (l ) in motor effect 93
Lenz, Heinrich
application to DC motors 121
law of 1045
467
Index
leptons 2924
levitated magnets 21314
Lewis, Gilbert, names photons 231
light
aether theory of transmission 612
infra-red 197
intensity and wavelength 178
speed of 65
as wave and particles 179
wave properties of 2067
light-emitting diodes (LEDs) 195
light-year (ly) distance unit 389
lighting, fluorescent and neon 173
lightning protection 147
linear array transducers 309
linear electric motors 129
linear generator principle 133
linear particle accelerators 2867
linearising formulae 447
Livingston, Stanley, and linear accelerator
287
longitudinal relaxation time constant (T1)
358
Lorentz, Hendrik
and aether 63
factor () 67
loudspeaker-making activity 126
loudspeakers 91, 93
low Earth orbits (LEOs) 43
luminiferous aether 61
luminosity classes 3956
Lyman, Theodore, spectral series for
hydrogen atom 2334
M-mode ultrasound scans 313
Mach, Ernst (18381916), and ratio of speed
of sound 48
magnetic field strength (B)
as magnetic flux density 101
in motor effect 92
magnetic fields
and electric currents 87
forces on charged particles in 89
magnetic flux (B) 1013
magnetic hysteresis losses
in induction cookware 108
in transformers 140
magnetic levitation (maglev) 21314
trains 129, 21819
magnetic moment 355
magnetic poles 88
magnetic resonance imaging (MRI) 21718
applications 362
behaviour of hydrogen proton in 357
see also MRI scanners
magnetism, and spin of atomic particles
3545
magnification of telescopes 371, 3745
magnitudes of stars 397400
main sequence stars, properties relative to
Sun 425
mammograms 325
Manhattan Project 27980
Mariner 10, use of gravity assist by 45
Marsden, Ernest, and model of atom 22930
468
mass
equivalence to energy 724
relativistic 71
mass defect 267
massenergy 73
massluminosity relationship 413
massive stars, fate of 4301
matter, crystal structure of 2045
matter waves, de Broglies equation 24850
Maxwell, James Clerk
electromagnetic wave theory 1745
equations of 612
medical radioisotopes 283
medium Earth orbits (MEOs) 43
Meissner effect 212
Meitner, Lise
and beta decay 265
and radioactive isotopes 269
mesons 292
metal-oxide-semiconductor field-effect
transistor (MOSFET) 200
metals, crystal structure and electrical
conductivity 20910
methamphetamines, brain tissue loss in users
302
metric prefixes 4423
Michell, John, weighing the Earth 25
Michelson, Albert, and speed of light 62
MichelsonMorley experiment 624
results interpretation activity 756
microscopes, optical to electron 259
Minkowski, Hermann, and spacetime 72
moderators in nuclear reactor 281
molybdenum-99 production by ANSTO 353
momentum conservation, in slingshot effect
44
Moores law 223
Morley, Edward, and speed of light 62
Moseley, Henry 247
and model of atom 230
motion, components of 5
motor effect (F) 903
activity 97
MRI scanners
operation of 3601
see also magnetic resonance imaging
(MRI)
muons 286
n-type semiconductors 192
National Medical Cyclotron at Royal Prince
Alfred Hospital, Sydney, radioisotope
production at 282, 345, 353
nebulae, types 422
negative glow 158
neon lights 138, 173
neutrinos, Paulis proposal 2656
neutron, discovery of 2601
neutron diffractometers, ANSTOs Echidna
285
neutron scattering, applications of 272, 278
neutron stars 431, 432
neutrons, number (N) in atomic nucleus
2623
Index
quarks 2924
quasars, emission spectra for 394
racing magnets 107
radiation shielding, in nuclear reactors 281
radio antennae 1757
radio frequency (RF) waves 356
radio telescopes 3723
radio transmitters and receivers 1745
radio waves, speed of 1757
radioactive decay of isotopes 3403
radioactive tracers 282
radiographs, conventional 324
radioisotopes
decay of 3403
half-life of 343
production of 345
range of uses 2824
radiopharmaceuticals 283
choice of 3445
radiotherapy 283
random error 445
Reber, Grote, and radio telescope 373
rectifiers 195
re-entry into Earths atmosphere 4650
safe corridors 478
reflecting telescopes 3723
reflection nebulae 422
refracting telescopes 3712
regenerative braking 134
relativistic mass and speed 71
relativity, Einsteins principle of 589
relaxation of hydrogen protons after cessation
of RF pulse 3589
relaxation time constants (T1 and T2) of
body tissues 35960
research, presenting 4467
resistance, electrical
from crystal structure 210
losses in transmission lines due to 1445
and Ohms law 86
and power 86
resistive heating (Q)
in induction cooktops 108
in transformers 140
resonance 358
rest mass (proper mass) 71
reverse bias in pn junctions 195
RF transceiver coils in MRI scanners 361
right-hand grip rule 878
right-hand palm (or push) rule 89, 901,
123
rocket engines and stages 301
rocketry
history of 267, 29
researchers in 20th century 279
rockets
forces during take-off 334
thrust on 30
roller-coasters, eddy current braking in
11213
Rntgen, Wilhelm Conrad, discovers X-rays
207
RR Lyrae variable stars 41516
on HR diagram 414
469
Index
470
magnification 374
methods of sharpening images 37780
radio 3723
sensitivity 3756
television
CRT tubes used in 1689
LCD, plasma and SED-TVs 173
Tesla, Nicola, and AC motors and generators
141
thermionic devices, compared with solidstate devices 199
thermionic diodes and triodes (valves) 198
thermonuclear bombs 280
Thomson, George P, and electron beam
diffraction 252
Thomson, Joseph John 160
discovery of electron 1656
Thomson, William (Lord Kelvin), and
temperature scale 210
three-phase AC induction motors 1223
three-phase alternating current 1212
3D real-time ultrasound images 314
thrust on rocket (FT) 30
time dilation 667
time of flight of projectiles 6
torch without batteries 133
torque () on a rotating coil 11517
total internal reflection, in optical fibres 334
trains, eddy current braking in 11213
trajectories 3
components 57
transducers
piezoelectric 3089
for ultrasound 305
transformers
efficiency and design 13740
in electricity distribution 1445
in the home 144
principles 1367
transistors 1989
transmission towers for high-voltage AC
power 1456
transmutation, artificial and natural 2635
transverse magnetisation (Bxy) 359
transverse relaxation time constant (T2) 359
triodes as current amplifiers 198
triple alpha process 428
Tsiolovsky, Konstantin, rocket equation 27
twin paradox 68
2D real-time ultrasound scans 31314
2D slice image construction from CAT
scanning 3278
type-I and type-II superconductors 21213
ultrasound 3045
reflection at tissue boundaries 31012
ultrasound imaging
and acoustic impedance 31011
limitations 3078
in obstetrics 307
principles 3057
types of scans 31216
using piezoelectric transducers 3089
ultrasound scans, types 31214
Acknowledgements
We would like to thank the following for permission to reproduce photographs, texts and illustrations. The following
abbreviations are used in this list: t = top, b = bottom, c = centre, l = left, r = right.
Images.com: p. 2.
CartoonStock: p. 299.
Intel: p. 223.
iStockphoto: pp. 138, 139, 142, 178, 217, 307, 326, 338,
348br, 350tl, 380b, 441.
NASA: pp. 17, 26, 28t, 28b, 33, 36, 49a,b,d, 50, 60, 72,
382, 421, 423, 429, 431r.
D. Smyth: p. 381l.
NRAO/AUI/NSF: p. 373.
Fourmilab: p. 11.
Fundamental Photographs, NYC: p. 251l.
PA/Jeff Stanger: pp. 83, 90, 115, 118, 124, 132, 133, 144r
/John OByrne: p. 381r.
Photolibrary Pty Ltd: pp. 3, 12, 13, 15, 27, 28c, 91, 114,
119, 122, 127, 141, 146, 154b, 155, 167, 176, 180, 185,
197t, 204, 207, 210, 213, 227, 228, 235, 246, 249, 251c,
251r, 252, 254, 255, 260, 271, 306, 308, 309, 314t, 314c,
315, 319, 324r, 325tr, 328, 332r, 333, 337, 348tl, 349,
352tr, 362, 367, 390b, 392, 422, 433, 440.
471
Acknowledgements
472
FORMULAE SHEET
v =f
F = qvB sin
m1m2
r
Ep = G
F = mg
d2
v1
sin i
=
v2
sin r
vx 2 = ux 2
E =
F
q
v = u + at
R=
V
I
vy 2 = uy 2 + 2ay y
T2
aav =
vu
v
therefore aav =
t
t
F =
F = ma
F =
Ek =
mv 2
r
1
mv 2
2
1
a t2
2 y
4 2
Gm1m2
d2
E = mc 2
v2
lv = l0 1
c2
t0
tv =
v2
v2
c2
p = mv
Impulse = Ft
F
l
= k
I1I 2
d
F = BIl sin
1
d =
p
M = m 5log
IA
IB
= Fd
= nBIA cos
Vp
Vs
np
ns
d
10
( mB mA )
= 100
m1 + m2 =
Rf
Ri
[ Z2 Z1] 2
[ Z2 + Z1] 2
Charge on electron, qe
1.602 1019 C
Mass of electron, me
9.109 1031 kg
Mass of neutron, mn
1.675 1027 kg
Mass of proton, mp
1.673 1027 kg
340 m s1
9.8 m s2
Speed of light, c
3.00 108 m s1
c2
m0
mv =
DATA SHEET
GM
1
W = Fs
Ir
I0
y = uy t +
=
Vin
Z = v
Energy = VIt
r3
Vout
Vin
c = f
x = ux t
r
t
V
d
Vout
E = hf
P = VI
vav =
A0 =
4 2r 3
0
2
2.0 107 N A2
Mass of Earth
6.0 1024 kg
Planck constant, h
6.626 1034 J s
1.097 107 m1
1.661 1027 kg
931.5 MeV/c2
1 eV
1.602 1019 J
Density of water,
1.00 103 kg m3
GT 2
1
1
1
= R 2 2
n f ni
h
mv
473
474
20
Ca
40.08
38
Sr
87.62
56
Ba
137.3
88
Ra
[226]
19
K
39.10
37
Rb
85.47
55
Cs
132.9
87
Fr
[223]
Francium
Caesium
Rubidium
Potassium
Sodium
Radium
Barium
Strontium
Calcium
Magnesium
12
Mg
24.31
Beryllium
11
Na
22.99
Lithium
4
Be
9.012
3
Li
6.941
Hydrogen
1
H
1.008
Thorium
Protactinium
Uranium
92
U
238.0
Neptunium
93
Np
[237]
Promethium
61
Pm
[145]
Plutonium
94
Pu
[244]
Samarium
62
Sm
150.4
Hassium
108
Hs
[277]
107
Bh
[264]
Bohrium
76
Os
190.2
Osmium
Ruthenium
44
Ru
101.1
Iron
26
Fe
55.85
75
Re
186.2
Rhenium
Technetium
43
Tc
[97.91]
Manganese
25
Mn
54.94
Americium
95
Am
[243]
Europium
63
Eu
152.0
Curium
96
Cm
[247]
Gadolinium
64
Gd
157.3
110
Ds
[271]
Platinum
78
Pt
195.1
Palladium
46
Pd
106.4
Nickel
28
Ni
58.69
Copper
Gold
Berkelium
97
Bk
[247]
Terbium
65
Tb
158.9
111
Rg
[272]
79
Au
197.0
Silver
47
Ag
107.9
29
Cu
63.55
Name of element
Symbol of element
109
Mt
[268]
Iridium
77
Ir
192.2
Rhodium
45
Rh
102.9
Cobalt
27
Co
58.93
Gold
79
Au
197.0
KEY
Californium
98
Cf
[251]
Dysprosium
66
Dy
162.5
Mercury
80
Hg
200.6
Cadmium
48
Cd
112.4
Zinc
30
Zn
65.41
Einsteinium
99
Es
[252]
Holmium
67
Ho
164.9
Thallium
81
Tl
204.4
Indium
49
In
114.8
Gallium
31
Ga
69.72
Aluminium
13
Al
26.98
Boron
5
B
10.81
Fermium
100
Fm
[257]
Erbium
68
Er
167.3
Lead
82
Pb
207.2
Tin
50
Sn
118.7
Germanium
32
Ge
72.64
Silicon
14
Si
28.09
Carbon
6
C
12.01
Mendelevium
101
Md
[258]
Thulium
69
Tm
168.9
Bismuth
83
Bi
209.0
Antimony
51
Sb
121.8
Arsenic
33
As
74.92
Phosphorus
15
P
30.97
Nitrogen
7
N
14.01
Nobelium
102
No
[259]
Ytterbium
70
Yb
173.0
Polonium
Lawrencium
103
Lr
[262]
Lutetium
71
Lu
175.0
Astatine
Iodine
53
I
126.9
Bromine
35
Br
79.90
Chlorine
17
Cl
35.45
Fluorine
9
F
19.00
Radon
Xenon
54
Xe
131.3
Krypton
36
Kr
83.80
Argon
18
Ar
39.95
Neon
10
Ne
20.18
Helium
2
He
4.003
84
85
86
At
Rn
Po
[209.0] [210.0] [222.0]
Tellurium
52
Te
127.6
Selenium
34
Se
78.96
Sulfur
16
S
32.07
Oxygen
8
O
16.00
For elements that have no stable or long-lived nuclides, the mass number of the nuclide with the longest confirmed half-life is listed between square brackets.
The International Union of Pure and Applied Chemistry Periodic Table of the Elements (October 2005 version) is the principal source of data. Some data may have been modified.
Actinium
91
Pa
231.0
90
Th
232.0
Actinoids
89
Ac
[227]
60
Nd
144.2
Seaborgium
106
Sg
[266]
Tungsten
74
W
183.8
Molybdenum
42
Mo
95.94
Chromium
24
Cr
52.00
Praseodymium Neodymium
Cerium
Lanthanum
Dubnium
59
Pr
140.9
Rutherfordium
Actinoids
105
Db
[262]
Tantalum
73
Ta
180.9
Niobium
41
Nb
92.91
Vanadium
23
V
50.94
Lanthanoids
57
58
La
Ce
138.9
140.1
104
Rf
[261]
Hafnium
72
Hf
178.5
Zirconium
40
Zr
91.22
Titanium
22
Ti
47.87
89103
Lanthanoids
5771
Yttrium
39
Y
88.91
Scandium
21
Sc
44.96
Atomic Weight
Atomic Number