Академический Документы
Профессиональный Документы
Культура Документы
.0
~~"-1 ~
meters and measuring devices should be read by interpolating between the smallest
marked scale division. Generally the most sensible interpolation is to attempt to estimate ten divisions between the smallest marked scale division. Consider the section
... /"".
W
"-.
of a meter stick pictured in Figure 1 which shows the region between 2 cm and 5 cm. The smallest marked scale divisions are 1 mm apart. The location of the arrow in the
\
Q
figure is to be determined.
It is clearly between 3.4 cm and 3.5 cm. and the correct In this case a reading of 3.43 cm is estimated. Much of the data taken in data may contain Thus if data
~
. :::s
For this measurement the first two digits are certain, but the last digit is estimated.
This measurement is said to contain three significant figures. this laboratory will have three significant four or even five significant is to properly contain estimated. place. figures. figures. three must be known, and the fourth figures, but occasionally
:'i ~ ""~ ~
K
't
"""'
Note that there should only be one estimated place in any measurement.
~
~.
C\ ~ Q
four significant
Never attempt to make the estimate between marked scales to more than one figures when only three are justified.
For example. never attempt to locate the position of the arrow in Figure 1 as
~
\f\
"""- ~
J\
~ ~
%
5. Mistakes or Personal Errors All measurements are subject to errors. There are three types of errors which are
classified as personal. systematic, or random.
statistical errors. errors will be discussed the experimenter. later.
This section deals with personal errors while systematic and random In fact. personal they are called Instead. they are merely mistakes made by can be
same sense as the other two types of errors. Sometimes fundamentally different
~
.
..
in the process of taking the data or later in calculations done with the original data.
Either type of mistake is bad. but a mistake made in the data-taking process is probably
\ ~
\-L
worse because often it is not discovered until it is too late to correct it. Typical mistakes made in the data-taking process include misreading of scales and recording data incorrectly. have ten small divisions division mark one-half One common mistake in reading scales involves scales that between major divisions, and they also have an intermediate way between the major divisions. Such a scale is shown in
Figure 2. Note that the correct position of the arrow is 3.75 cm. The common mistake occurs when the experimenter position as 3.25 cm. is confused by the middle mark and incorrectly reads the of the type of It is very important to understand that there This example is only intended to be representative
~
!r1:!~r~~
-~--
~#!!i"';
~c
3
Figure
4
2.
are many different ways for scales to be marked that might lead to possible confusion in reading the scales. in advance. Instead of expecting to have all the potential sources of mistakes pointed out. either by the laboratory all the procedures instructions or by the instructor, students must develop a specific to attitude toward all data-taking processes. The correct attitude is one of skepticism about Essentially. this amounts However, it is simply not feasible to point out all the specific pitfalls
assuming that things are going to go wrong unless constant attention is given to making sure that no mistakes are made. own understanding many students do exactly the opposite. of their taken all aspects of the They assume that whatever they are doing is correct without being self-critical of the procedure. For every measurement process must be checked and rechecked.
that he knows exactly what is supposed to be measured, what the correct procedure is to measure it. and that no mistakes are being made by the group in carrying out that
"'\
procedure. Exactly the same self-critical approach must be taken to avoid mistakes in calculation of other quantities from the measured quantities. There are at least two kinds of mistakes that can be made in these calculations. The first kind of mistake is to incorrectly state the equation for some derived quantity in terms of the measured qu~ntities. For example. suppose that the distance an object travels and the time it takes that object to travel that distance. are both measured. The task is to calculate the average velocity of the object. The velocity can only be calculated correctly by dividing the distance by the time. Any other calculation using that data will not lead to velocity. Typical mistakes might be to divide the time by the distance or perhaps multiply the distance by the time. Mistakes of this kind are difficult to detect when the first calculated quantity. in this case velocity. is then used to calculate another quantity. For example. if an incorrectly calculated value for velocity is then used to calculate the acceleration, the results obtained for the value of the acceleration will obviously be incorrect. The situation is even more complicated by the fact that often when this kind of mistake is made. a whole series of repeated calculations is performed incorrectly. The second kind of common mistake occurs when incorrect arithmetic operations are performed on a correct equation. Considering the fact that most students do all their ~
--
-I
"I.
,~
calculations with hand-held electronic calculators. this only amounts to pushing the wrong button,but it is a very common mistake. Again the best defense against such a mistake is to cultivate the proper habit of continually checking and rechecking all calculations. If all partners do their own independentcalculations.mistakes will ohen be found that would not be discovered if one person's calculations were unquestionably acceptedby the group. A final point to be made about calculationsis, to the extent possible, all calculations should be completedbefore leaving the laboratory. Not only are calculationalmistakes more likely to be detected and corrected,but sometimesattemptsto perform calculations with the data that has just been taken make it clear that mistakeswere made in the datataking process. If time permits the original mistake made in taking the data can be corrected by performingthe experiment again.
~~I;~c"";
~ii'<;"" ::~c,'
~~i;;'(;I_<"
.
(\,
,.;},.!'/
between excellent results and complete chaos is the care with which simple
Sometimeseven if the measurementsare performedin a basically correct manner, the point to the exercise may not be readily apparentbecauseof small additionalscatter in the data causedby carelessness. For example,if a seriesof measurements the one like illustrated in Figure 1 is made, but only two significantfigures are recorded, the effect being measuredmay change so slightly that the smooth nature of the changes in the measured quantitymay not be apparent. The studentwho strives successfullyto develop careful experimentaltechniques will be rewardedboth by success in the laboratory as well as by preparationfor future advance laboratorywork.
.t:; " ~
.value. The terms, accuracyand precision, are used to describe differentaspects of the
difference between the measured value and the true value of some quantity. Note that although most dictionaries make no distinction between them, from a scientific point of view, the terms accuracy and precision have very different meanings. The accuracy of a measurement is determined by how close the result of the measurement is to the true value. For example, in several of the experiments a value for the acceleration due to gravity (g=9.80 m/sec2) will be determined. For this case the accuracy of the result is decided by how close it is to the true value of 9.80. For several of the laboratory experiments the true value of the measured quantity is not known, and the accuracy of the experiment cannot be determined from the available data. It is worth noting that even for current research-level experimental work in the physical sciences the determination of the accuracy of a result is extremely difficult in most cases. The precision of a measurement refers essentially to how many significant digits there are in the result. It is an indication also of how reproducible the results are when measurements of some quantity are repeated. When repeated measurements of some quantity are made, the mean of those measurements is considered to be the best estimate of the true value. The smaller the variation of the individual measurements from the mean, the more precise the quoted value of the mean is considered to be. This idea about the relationship between the size of the variations from the mean and the precision of the measurement shall be elaborated upon and quantified in a later section on statistical methods. As an illustration of these ideas about accuracy and precision, consider the following sets of hypothetical measurements of the acceleration due to gravity made by four students named Alf, Beth, Carl, and Dee. Each student made three independent measurements and then took the mean of those three measurements as his or her value for g. Also shown in the table is the magnitude of the deviation of each individual measurement from that particular student's mean. The results are: Alf dev 7.83 1.60 11.61 2.18 8.85 0.58 9.43 Beth dev 9.53 0.27 9.38 0.12 8.87 0.39 9.26 Table 1. Carl dev 8.70 0.04 8.75 0.01 8.77 0.03 8.74 Dee dev 9.72 0.04 9.86 0.10 9.70 0.06 9.76 ~
~c'
.-.
Measurement 1.
Measurement 2.
Measurement 3. Mean
In order to be sure of the interpretation of the data table, note that the first set of entries in the table indicates that Airs three measurements were 7.83, 11.61, and 8.85, and his quoted value for the measurement is 9.43 which is the mean of those three values. Furthermore, the magnitudes of the deviations of each of his individual 6 ~" .-
..
..
'
,,'"
:'*",~W'!i""'f;"""'~_: .
:"',iii~';(c,;c, ,
-I.
measurements ~
should have units of m/sec2, but the units are omitted here for convenience. The accuracy of each student's data is determined by comparing the mean with the true value of 9.80. Doing so indicates that Dee's value of 9.76 is the most accurate, Airs
W
~
~.I
~,~~ ':
value of 9.43 is sec.ond,Beth.'s value of 9.2~ ~s third, and Carl's value of 8:74. is the.
leas~ ~ccurate. . Using the size of th~ deviations fro~ the mea~ as a crlterlo~ f~r
pr~clslon, one finds t.hat Carl's value. IS the most precise, Dee's IS second, Beth s IS thIrd, a~d Airs value IS th~ lea~t precise. StUd: the data very carefu~ly to be sure that There IS an
II_'~~
~"".:::;
~~.
..
..
the basIs for thes,e co,nclu~lons IS cl.ear fro~ the Ideas that have been given thus far..
In fact the situation IS not quite so simple as has been presented.
I
"',,'
'
"
measurement appears to be very accurate, and the precision is must be considered. ari~es interplay between the concepts of accuracy but precision which poor, the question If a
whether or not the results are really meaningful.
only differs from the true value
a
~"~'?
"
;:
"""",
However,
-':~.'Cc ,"'~,\
have deviations
deviations are much larger than 0.37. It seems much more likely then that Airs mean of 9.43 is due to luck than to a careful measurement. It seems likely, however, that Dee's
. .. l'C"
",
C
,0 ~;;
mean of 9.76 is meaningful because the deviations of her individual measurements from the mean are small. In other words, unless a measurement has high precision it cannot" really be considered to be accurate. Airs data cannot really be taken to be as accurate as his mean implies because his precision is not good enough to warrant faith in that mean. An examination of the significant figures given in this data leads to essentially the
,o,~~
same evaluation of each student's data. Consider Alfs data which indicates by the values stated for the individual measurements are significant. that two places to the right of the decimal point On the other hand, Dee's results However, that conclusion is not supported by the fact that his deviations
occur in the first digit to the left of the decimal point. the fact that two places to the right of the decimal but Airs results are questionable.
show deviationsin the second place to the right of the decimalpoint in agreementwith
are given as significant in the
measured values. Thus from another point of view, Dee's results are seen as meaningful,
Carl's results, on the other hand, are an example of a situation that is common in the interplay between accuracy and precision. Carl's precision is extremely high yet his accuracy is not very good. When a measurement has high precision but poor accuracy, it is often the sign of a systematic error. Consider now this important class of errors.
"
8; Systematic Errors
Systematic errors are errors that tend to be in the same direction for repeated
;""~
measurements, giving results that are either consistently above the true value or
.
N
7,
,consistently below the true value. In many cases such errors are caused by some flaw in
the experimental apparatus. For example, a voltmeter could be incorrectly. calibrated in ~
,.
,; .
such a way that it consistently gives a reading that is 80% of the true voltage across its input terminals. It is also possible to have a voltmeter with a zero offset on its scale which is assumed for this discussion to be 0.50 volts. In the first case the error is a constant fraction of the true value (in this case 20%), and in the secondcase the error is a constant absolutevoltage. Either one of these is a systematicerror,and the answer to the question of which one is worse, depends upon the magnitudeof the voltage to be measured. If the voltage to be measured is 1.00volts, then the meter with absolute error of 0.50 volts causes an error of 50%, while the meter with relative error causes an error of 20%. On the other hand, if the voltage to be measuredis 100volts, the meter with absolute error of 0.50 volts causes only a 0.5% error, while the other meter still causes a 20% error, or in this case, 20 volts. Also note that if this
.:~i
.~-
measured voltageis usedto calculatesome other quantity, it too will show a systematic
error in the results. It is difficult to deal with systematic errors caused by equipment problems in the time constraints of this laboratory. One cannot afford to spend a great deal of time each week assuring that all of the equipment has little or no calibration error. On the other hand, it is always a mistake to simply assume that all the equipment in the laboratory works as it should and is well calibrated. The same attitude mentioned previously of healthy
measurements
:\:
skepticism
made
is appropriate
by two groups on
for dealing
the same
with equipment.
will otten serve to
A few quick
at least prove ~
object
that both group's equipment seems to perform the same or nearly the same. This does not confirm that all is well, but it gives some confidence that the equipment is satisfactory. If there are large differences be.tween the results of the two groups, further investigation is in order. When faulty equipment is suspected ask the instructor for help in either repairing or replacing the equipment. The general question of the status of equipment in these laboratories is a constant problem apart from the question of systematic errors. It is worth noting here that students often have a very negative feeling about the laboratory unless all the equipment works perfectly. It is a common attitude for students in this laboratory to feel that if their equipment is not perfect, then they are absolved of any responsibility to perform the experiment. Instead of taking that attitude, it is more productive to be aware that
dealing with equipment including problems is one of the most level. important aspects of all physical to
.-
"'"
'.
~
~,.
This laboratory
is a good place
is failure This
into account
important
be a problem
done
in this laboratory.
factors need to be considered, but may not have the ability to quantitatively take them
into account. For example,when using an air table to validate Newton'sLaws, as will be . 8
-~-
-'
..
,A:
done in this laboratory,it is commonto ignore friction. This is done becausefriction is assumedto be small, but also becauseoften there is just no way to easily determineits contribution. It is expected, therefore, that neglecting friction will introduce a systematic error. Although it may not be possible to eliminate a systematic error in this case, the experimentcan be performedin such a way that any systematicerror is minimized. For this example, simply choose the forces that are applied to any mass to be as large as possible, consistentwith any other constraintsof the experiment. If such appliedforces are large compared to the frictionalforce, the smaller is the systematicerror. As previouslymentioned.Carl's resultsfor the measurementof g seem very likely to indicate a systematic error. His precision is very high, but all of his individual
measurements are about 11% below the true value. In such cases one will very often be
asked to state whether
In attempting to decide if, for example, friction might be the cause of the error, it is important to consider whether the assumed source
would tend to produce results that are too high or too low.
of g was done by directly measuring some acceleration,
friction would tend to cause the results to be too low, and in that case, it is reasonable to suggest friction as a possible cause of the systematic error. On the other hand. if one's measurements for the same experiment showed the same kind of precision as Carl's, but
~
-
11.37,
it is not possible that friction is the problem because the errors will usually be errors to the extent error and suggest
error is in the wrong direction. For purposes of this laboratory the concern with systematic twofold: possible; (I) to attempt to eliminate any obvious systematic
9. Random Errors
The final class of errors are those which are produced by unpredictable and unknown variations carefully estimate l
~~"';
process
as
The variations
the last digit the same way every time will definitely can be caused by fluctuations
Other variations
mechanical vibrations or any of1he many physical variations that may be inherent in
the equipment or any other aspect of the measurement process. ~~f"ti It is important to realize
the following difference between random errors and personal and systematic errors. In principle all personal and systematic errors can be eliminated. but there will always
remain some random errors in any measurement. Even in principle the random errors
"c.c';"':';;cJ c""':~~,, ~
9
.~;j4'~;:' , '..' ~'.'. .
..
."can never be completely eliminated. This might lead one to falsely conclude that random errors are always the largest ~ contribution because the other kinds can be eliminated in principle. However, as has and in fact, errors been said before. in practice systematic errors are not easily eliminated, may be the largest source of error in the experiment. errors may be the largest contribution unreliable systematic estimates can be made of some systematic ~'-
there are many published examples in scientific journals today where systematic
errors is that there is no prescribed process to detect them or to determine in a prescribed way. This is
their size. Random errors, on the other hand, can be determined true because according function, mean. it has been found empirically statistical that random function to a particular distribution
errors often are distributed called the Gauss distribution about a peak at the
curve distributed
symmetrically
a bell-shaped
histogram of the frequency distribution simply means a graph of the number of times the measurements fall within a certain range versus the measured values.
~ 10.
Assume a series of repeated measurements is made in which there are no systematic or personal errors, and thus only random errors are present. Assume there are N statistical theory measurements from 1 to N. value. equation made of some quantity x, and the ith value obtained is xi where i varies If it is true that the errors are normally mathematical distributed, to the true
says that the mean of the above N measurements is the best approximation In formal
x=
(1-) f N
Xi
(1)
L
1
(or in other words addition) process in which the sum is taken from i=1 to 18.60, 19.30, 17.70, and 20.40. Equation (1) is simply shorthand
i=N. For example, assume four measurements are made of some quantity x, and the four notation for the averaging process given by
10
~;!"~,~:,t::"";):,"~#;:~:c,
"co.
~'":
,
(2)
-. ~
-(1/4)(18.60+19.30+17.70+20.40)=19.00.
""
The fact that the mean is the best approximation the true value is not surprising to and seems intuitively reasonable. That the mean is indeed the proper choice can be proved mathematicallyby somethingcalled the principle of least squares which can be stated in the followingway. The mostprobablevalue for some quantity determined from
a s~ri~s of measurements is that value which minimizes the sum of the squares of the
18-.
c c
;,~~"::
. .
~,
~ --
':';~
,.c"
!"';.~
.I'."R.
Sta..isticaltheory, ~urthermore, s..atesthat the precision of the measurement can be determinedby calculationof a quantitycalledthe standarddeviationfrom the meanof the
measurements. The symbol for standard deviation from the mean is 0", and it is defined by the equation "\lr':~~:;~'~::'[~~~;];-N 2
0' = -I,[Xj-X]
~.~,
---
.~~...",,";,.
~~T"'
, ,~ '
N-1
(3)
~~
,~
.II
~ ""
P
;;""" "~
For th.e data given, the standard deviation is calculated from equation (3) to be the following:
0"=
_.1.""
~
~
1, ~18.6-19.0) 4-1
+(19.3-19.0)
+(17.7-19.0)
+(20.4-19.0)
= 1.14
precision of the measurement in the following statistical sense. It gives the probability that the measurements fall within a certain range of the measured mean. From the sample standard deviation and tables of the standard error function, it is possible to
. .~
11111"1
determine the probability that the meas.urements within any desire.d~ange fall about the mean. The commonrangeto be quotedISthe rangeof one standarddeviationas calculated
by equation (3). Probability theory states that approximately 68.3% of all repeated measurements should fall ~ithin a range of plus or minus 0' from the mean. Furthermore 95.5% of all
repeated
,.,.
For the
example given above, 68.3% should fall in the range 19.00 :t 1.14 which is from 17.86 to 20.14, and 95.5% should fall in the range 19.00:t 2.28 which is from 16.72 to
.
"c~'
21.28.
c~~~
Actually to really make meaningful statements about the expected range, one
should do more than four repeated measurements. Something like 10 to 20 measurements would probably be needed to really determine the expected range.
.
-'c"
As a final note on the expected distribution for measurements that follow a normal
error curve,
ThiS.
11
~~~~,~",~:;:;""~!!-;.:-},,,c
.,~
:~~
c,~,,'"
"!'~~~","
" "-
is 3<J or farther
it is very
~
.:..
A second issue that can be addressed by these repeated measurements is the precision of the mean. standard After all, this is what is really of concern, of the mean. since the mean is the best standard deviation estimate of the true value. The precision of the mean is indicated by a quantity called the deviation Note carefully that the previous considered was the standard deviation from the mean. The standard deviation of the mean whose symbol is Om. is defined by Om = ~ YN where 0 is the standard deviation measurements. The significance producing
;~;,,;c,:"~
(4)
of repeated
For the example given above with 0=1.14 and N=4, one gets om=O.57. of
Om
a value
19.00:!:O.57.
~~";"-'
~;";;;,,,.,
probability that the true value lies in this range. Of course all these statements valid only if there are no other errors present other than random errors.
~" "
"c
In this laboratory students will often be asked to make repeated measurements of some quantity and to determine the mean. Normally when the mean is given as the measured value, the standard deviation of the mean, Om, should be quoted as the error in "
that mean.
~.:
~;,
'.
~"'~~'cc.,'
,
~..
z (meters)
~ 7.57 11.97
16.58
t (seronds)
1.00 2.00
3.00
4.00 5.00
From these data the average speed over each time interval can be calculated. The averagespeedv over some time interval At during which a distance interval ~z was traveled is given by v =~ (5) ~t For the data of Table 2 there are four intervals for the five data points giving for the first two intervals.
V, = 11.97- 7 .57 = 4.40 m/sec 2.00-1.00 V2 = 16.58-11.97 3.00-2.00 = 4.61 m/sec
The other two intervalsgive average speeds of 4.42 m/sec and 4.49 m/sec. A basic question is on what basis was the decision made to expressv,. for example, as 4.40
~~ -
rules for significantfigures to include calculations. These calculational rules have been introduced here, rather than in the section on significantfigures, in order to emphasize that these rules are related to error propagation. The followingrules should be used to determinethe numberof significantfigures to retain at the end of a calculation: (1) When adding or subtracting, figures to the right of the last column in which all figures are significantshould be dropped. (2) When multiplying or dividing, retain only as many significant figures in the result as are containedin the least precise quantity in the calculation. (3) The last significant figure is increased by 1 if the figure beyond it (which is dropped) is 5 or greater. These rules apply only to the determinationof the numberof significant figures in the final result. In the intermediate steps of a calculation, one more significant figure should be kept than is kept in the final result. Consideras an examplethe additionof the followingnumbers:
~,;"
.
!~,
",Q'!&f"J:;!? ."
13
,..,;.c,*,~,;.~-y,c""
i;;':i'~~..c""
':'~';';'?;".C
:j~~,"r
-~
.753.1 37.08
0.697
" .-
753.1 37.1
~ 0.7
56.3
847.177
56.3
847.2
.;
Following the above rules strictly implies rewriting each number as shown at the
right in which the first digit beyond the decimal true because that column is the rightmost is the least significant digit. This is column in which all digits are significant.
Note that one gets the same result if the numbers are added on the calculator (as done at the left), and then it is noted that the first digit beyond the decimal is the last one that can be kept. Therefore 847.177 is rounded off to 847.2. The same is true for multiplication and division. Consider the calculations below. 327.23 8.90906
12019.158
X 36.73
36.73
-- --
I --- -I 327.23
In each case the result is rounded to four significant figures because the least significant number in each calculation (36.73) has only four significant figures. For the multiplication the result is 12020, and for the division it is 8.909. '"
Consider again the determination of average speed from the data of Table 2. The
reason that the results were given to three significant figures is clear from the application of the above rules. In the subtraction processes to obtain the 6zs and the 6 ts, there are three significant figures for each of those quantities when the subtraction rules are applied. The average velocity is then calculated by dividing a 6z, having three significant figures, by a 6t, also having three significant figures. Therefore, the result has three significant figures. If, for example, the original data for the time had only been given to two significant figures as 1.0, 2.0, 3.0, 4.0, and 5.0 seconds, the resulting speeds would have been 4.4, 4.6, 4.4, and 4.5 m/sec. It is difficult to overstate the importance of properly taking significant figures into account in all calculations done in this laboratory. More points on laboratory grades have probably been lost by more students throughout the years over the issue of significant figures than any other single concern. Most instructors have absolutely no sympathy for failure to adhere to these rules. The wise student will learn these rules for significant figures and apply them without fail. The second case to be considered concerning the propagation of errors occurs when calculations are to be done on measured quantities to derive another quantity, but now it
..
is assumed that there is specific knowledge of the errors in the original measured quantities. ~
14
:CC"';'.c,~~",~~,".
..
.~
Suppose variables y, and z are measured, their means V, Z, and standard x. and X,
deviation of the meansCJmx' CJmy' CJmz determined. SomequantityA(i,y,Z) is and are calculated from the mean values X, y, and z. The question is what is the standard
deviation of the mean, CJmA. the calculated quantity A in terms of amx, amy' and CJmz. of
;1-'"---
Obviously this relationship can be generalized to any number of variables. Those who have studied calculus recognize the symbol aA/ax as the partial derivative of A with respect to x. The other symbols are defined in a similar manner. It is understood that somepeopletaking this coursehave studiedcalculus,and some have not. Those who have studied calculus should be able to derive an equation for CJmAfor any
,
r" , r'
,:. " ,
,..
.,
c,,- -,
(CJmA)2=
y2(amx)2 + x2 (CJmy)2
~ Those who have not studied calculus should not be intimidated by its use here.
used to show the origin of these equations for the benefit of those who can use calculus to derive the appropriate equations needed equation for other cases of interest. in the laboratory instructions In practice will be given in a form that only
requires the use of algebra. The point to these equations is to give the precise answer to the question of how errors propagate for the case in which the errors in the measured quantities are known. The rules for calculating with significant implied by equation (6). figures which were given earlier are only an such approximate way to handle error propagation in the absence of more complete knowledge On the other hand for most cases in this laboratory. detailed knowledge of the errors is lacking, and the significant figure calculations are the best way available to estimate the precision of the result.
! I' (",',
I'
",
,-
..
".;!,
r--"
.,
In many
cases of interest it is assumed that there exists a linear relationship between the two
15
.:"
~"
c?11t!{i.~'.~
"c;c,~...c.",~
~..
","","~;c,!:",
variables. In mathematical terms one can say that the variables obey an equation of the
form ~
.~
..
y = mx+ b (10) ~ where m and b are constants. This also implies that if a graph is made with x as the ~
abscissaand y as the ordinate,it will be a straight line with m equal to the slope (defined as 6y/6X) and b equal to the y intercept(the value of y at x=O). The question is how to best verify that the data do indeed obey equation(10). One way is to make a graph of the data, and then try to draw the best straight line possible through the data points. This will give a qualitative answer to the question. but it is possibleto give a quantitativeanswerto the questionby the processdescribedbelow. The measurements are repeated measurements in the sense that they are to be considered together in the attemptto determine to what extent the data obey equation (10). It is possible to generalize the idea of minimizing the sum of squares of the deviationsdescribedearlier for the mean and standarddeviationto the presentcase. The result of the generalizationto two variable linear data is called a linear least squares fit to the data. It is also sometimes referredto as a linear regression. The aim of the process is to determinethe value of m and b which producethe best straight line fit to the data. Any choiceof values for m and b will producea straightline, with values of y determinedby the choice of x. For any such straight line (determined by a given m and b) there will be a deviationbetweeneach of the measuredy's and the y's from the straight line fit at the value of the measuredxis. The least squaresfit is that m '" and b for which the sum of the squares of these deviations is a minimum. Statistical ~ theory states that the appropriate values of m and b that will produce this minimumsum ~ of squaresof the deviationsare givenby the following equations:
N
N L
1 XiYi
-( L
1
xJ( L Yi)
1
m=
~
f"
(11)
N
"
R,
7 "'" ""
L Xi - ( L xJ
1 1
~"
'"
~
... C
Ic" \t~,~~~~1; .,
N N (L yJ(L
Xi
)-(
L
N 1
XiyJ(
L
N 1
xJ
(12)
cc~:"v,-'"
ff~~
b-
NL Xi2_( L
1 1
XJ2
Here N is the numberof data points,xi and Yi are the measuredvalues, and L stands for
1
the summationfrom i=1 to i=N. Refer again to the data of Table 2 for co-ordinatepositionversus time. The question to be answeredis whether or not the data are consistentwith constant velocity. It is not~
16
,
;
r
,
assumed that z=O at t=O. This is the realistic case for much of the data taken in this laboratory. intervals determined. Often the experiments are done in such a manner that there are fixed time (provided by a spark-timer with fixed frequency), but t=O is arbitrarily For these conditions, if the speed v is constant, the data can be fit by an
z=vt+Zo
Equation (13) is/of the form of equation (10) with z corresponding of z versus t, and Zo will be the intercept. arbitrarily leads to,
N N N
(13)
to x, v corresponding to m, and Zo corresponding to b. Thus v will be the slope of a graph which is the co-ordinate into equations
~~
"".",""f~~;,..,'.;:,'o
"
(14)
1 ~'r;"c"""~",,,~,r"!i:J~~'~C~',~~
"~"""'"
--C"'""'-~~-'-"""::";;:"C"!"c"",
1 N L
N N (L Zj)(N t 2)-( L L
tiZj(
t) (15)
"a ~
" "~~
20 ~
1 N 1
1 N 1
NL t2-( L 11)2 Calculating some of the individual terms gives: Lti = 1.00 + 2.00 + 3.00 + 4.00 + 5.00 = 15.00
Lti2 Lzi Ltizi = (1.00)2 + (2.00)2 + (3.00)2 + (4.00)2 + (5.00)2 = 55.00
= 7.57 + 11.97 + 16.58 + 21.00 + 25.49 = 82.61 = (1.00 X 7.57) + (2.00 X 11.97) + (3.00 X 16.58) + (4.00 X 21.00) (5.00 X 25.49) = 292.70
LZj2
= (7.57)2
+ (11.97)2
+ (16.58)2
+ (21.00)2
+ (25.49)2
= 1566.22
in equations (14) and (15) gives v = 4.487 and Zo = 3.061. to be 4.487 m/sec, and the co-ordinate position at t=O
At this point the -best possible straigh1 line fit to the data has been determined by the least squares fit process. A second goal remains, and that is to determine how well the
-:
'
data actually fit the straight line that has been obtained. Again a qualitative answer to this question- is obtained by making a graph of the data and the straight line and .. . . . qualitatively Judging the agreement between the line and the data. There is, however, a quantitative measure of how well the data follow the straight
17
~""c (
"
..~
line obtained by the least squares fit. It is given by the value of a quantity called the
correlation coefficient, r. This quantity is a measure of the fit of the data to a straight \ line with r=1.00 exactly signifying a perfect correlation, and r=O signifying no correlation at all. The equation to calculate r in terms of the general variables x and y is given by
N N
1
N
1
(16)
2N 2 1
2N
Xi
2 1
IN 1
.,.J N2,
1
versus time implies substituting t for x and z for y in the above equation to get
N N N
r=
'~"~~-i.'FC'c
(17)
,j NI ti 2_( f
1 1 .; c.;
t)2
,j NI
1
Zj2_(
f
1
ZJ2
Using the appropriate numerical values calculated earlier in equation (17) gives r=0.9998. Thus the data show an almost perfect linear relationship since r is so close to 1 .00. It is extremely important to study this example carefully and to completely \
understand the process of linear least square fitting of data. A very high percentage of the laboratories in this course will involve two variables which are linearly related. required. For these cases a least squares fit to the data will usually be and mean and standard deviation Although the least squares fit calculations
calculations are not difficult in principle, they are tedious and time-consuming. Many hand-held calculators have automatic routines built in that allow the calculation of these quantities that many by simply inputting will the data points one after another. perform mean and standard It is advised that students purchase such a calculator if they do not already own one. calculators automatically. but fewer calculators Be careful to note calculations
deviation
will also perform an automatic linear least squares Also note that most calculators will calculate
.
fit (or it may be called linear regression). two different standard deviations.
and it is the
sample standard deviation. Also available on most calculators is a quantity which is usually denoted as aN. II applies to the case when the population is known, and it will never be appropriate for data taken in the laboratory. Always be sure to choose the
quantity
N-1 which is the one defined by equation (3). When performing a least squares fit to data, particularly when a small number of data points are involved, there is some tendency to obtain a surprisingly good value for r even for data that does not appear to be very linear. For those cases the significance of r ~
18 .